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1. PREFACE v 

1. Preface 

These notes are the outgrowth of a graduate course on Lie groups I taught 
at the University of Virginia in 1994. In trying to find a text for the course I 
discovered that books on Lie groups either presuppose a knowledge of differentiable 
manifolds or provide a mini-course on them at the beginning. Since my students 
did not have the necessary background on manifolds, I faced a dilemma: either use 
manifold techniques that my students were not familiar with, or else spend much 
of the course teaching those techniques instead of teaching Lie theory. To resolve 
this dilemma I chose to write my own notes using the notion of a matrix Lie group. 
A matrix Lie group is simply a closed subgroup of GL(n;C). Although these are 
often called simply "matrix groups," my terminology emphasizes that every matrix 
group is a Lie group. 

This approach to the subject allows me to get started quickly on Lie group the- 
ory proper, with a minimum of prerequisites. Since most of the interesting examples 
of Lie groups are matrix Lie groups, there is not too much loss of generality. Fur- 
thermore, the proofs of the main results arc ultimately similar to standard proofs 
in the general setting, but with less preparation. 

Of course, there is a price to be paid and certain constructions (e.g. covering 
groups) that are easy in the Lie group setting are problematic in the matrix group 
setting. (Indeed the universal cover of a matrix Lie group need not be a matrix 
Lie group.) On the other hand, the matrix approach suffices for a first course. 
Anyone planning to do research in Lie group theory certainly needs to learn the 
manifold approach, but even for such a person it might be helpful to start with a 
more concrete approach. And for those in other fields who simply want to learn 
the basics of Lie group theory, this approach allows them to do so quickly. 

These notes also use an atypical approach to the theory of semisimple Lie 
algebras, namely one that starts with a detailed calculation of the representations 
of sl(3;C). My own experience was that the theory of Cartan subalgebras, roots, 
Weyl group, etc., was pretty difficult to absorb all at once. I have tried, then, to 
motivate these constructions by showing how they are used in the representation 
theory of the simplest representative Lie algebra. (I also work out the case of 
sl(2;C), but this case does not adequately illustrate the general theory.) 

In the interests of making the notes accessible to as wide an audience as possible, 
I have included a very brief introduction to abstract groups, given in Chapter 1. 
In fact, not much of abstract group theory is needed, so the quick treatment I give 
should be sufficient for those who have not seen this material before. 

I am grateful to many who have made corrections, large and small, to the notes, 
including especially Tom Goebeler, Ruth Gornet, and Erdinch Tatar. 
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CHAPTER 1 



Groups 

1. Definition of a Group, and Basic Properties 

Definition 1.1. A group is a set G, together with a map of G x G into G 
(denoted g\ * g2 ) with the following properties: 
First, associativity: for all g\,g2 € G, 

(1-1) 9i * (92 * 93) = (gi *92) *53- 

Second, there exists an element e in G such that for all g e G, 

(1.2) g*e = e* g = g. 
and such that for all g e G, there exists h e G with 

(1.3) g * h = h* g = e. 

Ifg*h = h*g for all g, h e G, then the group is said to be commutative (or 
abelian). 

The element e is (as we shall see momentarily) unique, and is called the iden- 
tity element of the group, or simply the identity. Part of the definition of a 
group is that multiplying a group element g by the identity on either the right or 
the left must give back g. 

The map of G x G into G is called the product operation for the group. Part 
of the definition of a group G is that the product operation map G x G into G, i.e., 
that the product of two elements of G be again an element of G. This property is 
referred to as closure. 

Given a group element g, a group element h such that g* h = h* g = e is called 
an inverse of g. We shall see momentarily that each group element has a unique 
inverse. 

Given a set and an operation, there are four things that must be checked to show 
that this is a group: closure, associativity, existence of an identity, and existence of 
inverses. 

PROPOSITION 1.2 (Uniqueness of the Identity). LetG be a group, andlete,f e 
G be such that for all g G G 

e*g = g*e = g 
f * 9 = 9* f = 9- 

Then e = /. 

PROOF. Since e is an identity, we have 

e * / = /• 
l 
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On the other hand, since / is an identity, we have 

e * f — e. 

Thus e = e * / = /. □ 

Proposition 1.3 (Uniqueness of Inverses). Let G be a group, e the (unique) 
identity of G, and g, h, k arbitrary elements of G. Suppose that 

g*h = h*g = e 
g*k = k*g = e. 

Then h = k. 



Proof. We know that g * h — g * k 

h* (g * h) -- 

By associativity, this gives 

(h* g) *h - 
and so 



(= e). Multiplying on the left by h gives 
= h * (g * k). 

= (h* g) * k, 



e * h = e * k 
h = k. 

□ 

This is what we wanted to prove. 

Proposition 1.4. Let G be a group, e the identity element of G, and g an 
arbitrary element of G. Suppose h € G satisfies either h* g — e or g * h — e. Then 
h is the (unique) inverse of g. 

Proof. To show that h is the inverse of <?, we must show both that h * g = e 
and g * h — e. Suppose we know, say, that h* g — e. Then our goal is to show that 
this implies that g * h = e. 

Since h* g = e, 

g*(h*g) = g*e = g. 

By associativity, we have 

(.9 *h)*g = g. 

Now, by the definition of a group, g has an inverse. Let k be that inverse. (Of 
course, in the end, we will conclude that k = h, but we cannot assume that now.) 
Multiplying on the right by k and using associativity again gives 

((g *h)*g)*k = g*k = e 

(g * h) * (g * k) = e 

(g * h) * e = e 

g * h = e. 

A similar argument shows that if g * h — e, then h * g = e. □ 

Note that in order to show that h * g = e implies g * h — e, we used the fact 
that g has an inverse, since it is an element of a group. In more general contexts 
(that is, in some system which is not a group), one may have h * g — e but not 
g * h — e. (See Exercise |TT| .) 
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Notation 1.5. For any group element g, its unique inverse will be denoted 



g- 1 - 

Proposition 1.6 (Properties of Inverses). LetG be a group, e its identity, and 
g,h arbitrary elements ofG. Then 

(g-r^a 

(gh)- 1 = h-'g- 1 

Proof. Exercise. □ 

2. Some Examples of Groups 

From now on, we will denote the product of two group elements g\ and 92 
simply by 9192, instead of the more cumbersome g\ * 92- Moreover, since we have 
associativity, we will write simply #13293 in place of (31(72)93 or 91(9293)- 

2.1. The trivial group. The set with one element, e, is a group, with the 
group operation being defined as ee = e. This group is commutative. 



Associativity is automatic, since both sides of (1.1) must be equal to e. Of 
course, e itself is the identity, and is its own inverse. Commutativity is also auto- 
matic. 

2.2. The integers. The set Z of integers forms a group with the product 
operation being addition. This group is commutative. 

First, we check closure, namely, that addition maps Z x Z into Z, i.e., that the 
sum of two integers is an integer. Since this is obvious, it remains only to check 
associativity, identity, and inverses. Addition is associative; zero is the additive 
identity (i.e., + n = n + = n, for all n € Z); each integer n has an additive 
inverse, namely, —n. Since addition is commutative, Z is a commutative group. 

2.3. The reals and M. n . The set R of real numbers also forms a group under 
the operation of addition. This group is commutative. Similarly, the n-dimensional 
Euclidean space W 1 forms a group under the operation of vector addition. This 
group is also commutative. 

The verification is the same as for the integers. 

2.4. Non-zero real numbers under multiplication. The set of non-zero 
real numbers forms a group with respect to the operation of multiplication. This 
group is commutative. 

Again we check closure: the product of two non-zero real numbers is a non-zero 
real number. Multiplication is associative; one is the multiplicative identity; each 
non-zero real number x has a multiplicative inverse, namely, ^. Since multiplication 
of real numbers is commutative, this is a commutative group. 

This group is denoted R*. 

2.5. Non-zero complex numbers under multiplication. The set of non- 
zero complex numbers forms a group with respect to the operation of complex 
multiplication. This group is commutative. 

This group in denoted C*. 
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2.6. Complex numbers of absolute value one under multiplication. 

The set of complex numbers with absolute value one (i.e., of the form e 10 ) forms a 
group under complex multiplication. This group is commutative. 
This group is the unit circle, denoted S . 

2.7. Invertible matrices. For each positive integer n, the set of all n x n 

invertible matrices with real entries forms a group with respect to the operation of 
matrix multiplication. This group in non-commutative, for n > 2. 

We check closure: the product of two invertible matrices is invertible, since 
(AB) 1 = B~ 1 A~ 1 . Matrix multiplication is associative; the identity matrix (with 
ones down the diagonal, and zeros elsewhere) is the identity element; by definition, 
an invertible matrix has an inverse. Simple examples show that the group is non- 
commutative, except in the trivial case n = 1. (See Exercise ||.) 

This group is called the general linear group (over the reals), and is denoted 
GL(n;R). 

2.8. Symmetric group (permutation group). The set of one-to-one, onto 
maps of the set {1, 2, • • • n} to itself forms a group under the operation of compo- 
sition. This group is non-commutative for n > 3. 

We check closure: the composition of two one-to-one, onto maps is again one- 
to-one and onto. Composition of functions is associative; the identity map (which 
sends 1 to 1, 2 to 2, etc.) is the identity element; a one-to-one, onto map has an 
inverse. Simple examples show that the group is non-commutative, as long as n is 
at least 3. (See Exercise |l0|.) 

This group is called the symmetric group, and is denoted S n . A one-to-one, 
onto map of {1, 2, • • • n} is a permutation, and so S n is also called the permutation 
group. The group S n has n\ elements. 

2.9. Integers mod n. The set {0, 1, • • -n — 1} forms a group under the oper- 
ation of addition mod n. This group is commutative. 

Explicitly, the group operation is the following. Consider a,b € {0,1- ■ -n — 1}. 
If a + b < n, then a + b mod n = a + fe, if a + b > n, then a + b mod n = a + b — n. 
(Since a and b are less than n, a+b—n is less than n; thus we have closure.) To show 
associativity, note that both (a + b mod n) + c mod n and a+(b + c mod n) mod n 
are equal to a + b + c, minus some multiple of n, and hence differ by a multiple of 
n. But since both are in the set {0, 1, • • • n — 1}, the only possible multiple on n 
is zero. Zero is still the identity for addition mod n. The inverse of an element 
a 6 {0, 1, • • • n — 1} is n — a. (Exercise: check that n — a is in {0, 1, • • • n — 1}, and 
that a + (n — a) mod n = 0.) The group is commutative because ordinary addition 
is commutative. 

This group is referred to as "Z mod n," and is denoted Z„. 

3. Subgroups, the Center, and Direct Products 

Definition 1.7. A subgroup of a group G is a subset H of G with the follow- 
ing properties: 

1. The identity is an element of H . 

2. If he H, then h- 1 e H. 

3. Ifhi,Ii2 € H, then h\hi G H . 



4. HOMOMORPHISMS AND ISOMORPHISMS 
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The conditions on H guarantee that H is a group, with the same product 
operation as G (but restricted to H ) . Closure is assured by (j|) , associativity follows 
from associativity in G, and the existence of an identity and of inverses is assured 
by (|) and (§. ' 

3.1. Examples. Every group G has at least two subgroups: G itself, and the 
one-element subgroup {e}. (If G itself is the trivial group, then these two subgroups 
coincide.) These are called the trivial subgroups of G. 

The set of even integers is a subgroup of Z: zero is even, the negative of an 
even integer is even, and the sum of two even integers is even. 

The set H ofnxii real matrices with determinant one is a subgroup of GL(n; K) . 
The set H is a subset of GL(n; R) because any matrix with determinant one is invert- 
ible. The identity matrix has determinant one, so [I] is satisfied. The determinant of 
the inverse is the reciprocal of the determinant, so || is satisfied; and the determi- 
nant of a product is the product of the determinants, so || is satisfied. This group 
is called the special linear group (over the reals), and is denoted SL(n;R). 

Additional examples, as well as some non-examples, are given in Exercise |^. 

Definition 1.8. The center of a group G is the set of all g S G such that 
gh = hg for all h G G. 

It is not hard to see that the center of any group G is a subgroup G. 

Definition 1.9. Let G and H be groups, and consider the Cartesian product 
of G and H , i.e., the set of ordered pairs (<?, h) with g 6 G, h G H . Define a product 
operation on this set as follows: 

(gi,h 1 )(g 2 ,h 2 ) = (gig 2 ,h 1 h 2 ). 

This operation makes the Cartesian product of G and H into a group, called the 
direct product of G and H and denoted G x H. 

It is a simple matter to check that this operation truly makes G x H into a 
group. For example, the identity element of G x H is the pair (e\, e 2 ), where e\ is 
the identity for G, and e 2 is the identity for H . 

4. Homomorphisms and Isomorphisms 

Definition 1.10. Let G and H be groups. A map : G — > H is called a 
homomorphism if 4>(gig 2 ) = 4>{9i)4>{92) for all g\,g 2 G G. If in addition, <f> is 
one-to-one and onto, then (j) is called an isomorphism. An isomorphism of a 
group with itself is called an automorphism. 

Proposition 1.11. Let G and H be groups, e\ the identity element of G, and 
e 2 the identity element of H. If <fi : G — > H is a homomorphism, then </>( e i) = e 2, 
and 4>{g~ l ) = 0(g) -1 for all g G G. 

PROOF. Let g be any element of G. Then <fi(g) — (j>(ge\) — cf>(g)cf>(ei). Mul- 
tiplying on the left by ^(g) -1 gives e 2 = </>(ei). Now consider (/>(<7 _1 ). Since 



4>{e\) = e 2 , we have e 2 = <j>{e{) — 4>{gg ) = 4>{g)4>{g )■ In light of Prop. [L4 , we 



conclude that <f>(g x ) is the inverse of 4>(g). □ 

Definition 1.12. Let G and H be groups, <f> : G — > H a homomorphism, and 
e 2 the identity element of H . The kernel of <p is the set of all g G G for which 
<t>{9) = ea- 
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Proposition 1.13. Let G and H be groups, and <j) : G — > H a homomorphism. 
Then the kernel of 4> is a subgroup ofG. 

Proof. Easy. □ 

4.1. Examples. Given any two groups G and H, we have the trivial homo- 
morphism from G to H: <fr(g) = e for all g £ G. The kernel of this homomorphism 
is all of G. 

In any group G, the identity map (id(g) = g) is an automorphism of G, whose 
kernel is just {e}. 

Let G = H = Z, and define 4>(n) = 2n. This is a homomorphism of Z to itself, 
but not an automorphism. The kernel of this homomorphism is just {0}. 

The determinant is a homomorphism of GL(n, R) to R*. The kernel of this map 
is SL(n,R). 

Additional examples are given in Exercises [l2] and [?]. 

If there exists an isomorphism from G to H, then G and H are said to be 
isomorphic, and this relationship is denoted G = H. (See Exercise ||.) Two groups 
which arc isomorphic should be thought of as being (for all practical purposes) the 
same group. 



5. Exercises 

Recall the definitions of the groups GL(n;R\S n , R*, and Z„ from Sect. § and 
the definition of the group SL(n;R) from Sect. |3| 

1. Show that the center of any group G is a subgroup G. 

2. In (a)-(f), you are given a group G and a subset H of G. In each case, 
determine whether H is a subgroup of G. 

(a) G = Z, H = {odd integers} 

(b) G = Z, H = {multiples of 3} 

(c) G = GL(n; R), H = {A € GL(n; R) \dct A is an integer} 

(d) G = SL(n; R), H = {A e SL{n; R) |all the entries of A are integers} 
Hint: recall Kramer's rule for finding the inverse of a matrix. 

(c) G = GL(n;R), H = {A £ GL(n;R) |all of the entries of A are rational} 
(f) G = Z 9 , H = {0,2,4,6,8} 



3. Verify the properties of inverses in Prop. 1.6. 

4. Let G and H be groups. Suppose there exists an isomorphism <j) from G to 
H . Show that there exists an isomorphism from H to G. 

5. Show that the set of positive real numbers is a subgroup of R*. Show that 
this group is isomorphic to the group R. 

6. Show that the set of automorphisms of any group G is itself a group, under 
the operation of composition. This group is the automorphism group of 
G, Aut(G). 

7. Given any group G, and any element g in G, define </> s : G — * G by </> s (/i) = 
ghg" 1 . Show that <f> g is an automorphism of G. Show that the map g — > 4> g 
is a homomorphism of G into Aut(G), and that the kernel of this map is the 
center of G. 

Note: An automorphism which can be expressed as <p g for some g G G 
is called an inner automorphism; any automorphism of G which is not 
equal to any 4> g is called an outer automorphism. 
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8. Give an example of two 2x2 invertible real matrices which do not commute. 
(This shows that GL(2,R) is not commutative.) 

9. Show that in any group G, the center of G is a subgroup. 

10. An element a of the permutation group S n can be written in two-row form, 

1 2 ••• n 
C\ 02 ■ • • cr n 
where Oi denotes a(i). Thus 

12 3 
2 3 1 

is the element of S3 which sends 1 to 2, 2 to 3, and 3 to 1. When multiplying 
(i.e., composing) two permutations, one performs the one on the right first, 
and then the one on the left. (This is the usual convention for composing 
functions.) 
Compute 



and 



Conclude that S3 is not commutative. 
11. Consider the set N= {0, 1, 2, • • • } of natural numbers, and the set T of all 
functions of N to itself. Composition of functions defines a map of J- x T 
into T ', which is associative. The identity (id(n) = n) has the property that 
id o f = f o id = /, for all / in T . However, since we do not restrict to 
functions which are one-to-one and onto, not every element of T has an 
inverse. Thus T is not a group. 

Give an example of two functions /, g in T such that f o g = id, but 
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.9 / 7^ (Compare with Prop. 1.4.) 



12. Consider the groups Z and Z„. For each a in Z, define a mod n to be the 
unique element b of {0, 1, • • • n — 1} such that a can be written as a = kn+b, 
with k an integer. Show that the map a — > a mod n is a homomorphism of 
Z into Z n . 

13. Let G be a group, and H a subgroup of G. H is called a normal subgroup 

of G if given any g £ G, and h £ H, ghg^ 1 is in 

Show that any subgroup of a commutative group is normal. Show that 
in any group G, the trivial subgroups G and {e} are normal. Show that the 
center of any group is a normal subgroup. Show that if <f> is a homomorphism 
from G to H, then the kernel of <f> is a normal subgroup of G. 

Show that SL(n;R) is a normal subgroup of GL(n;R). 

Note: a group G with no normal subgroups other than G and {e} is 
called simple. 
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CHAPTER 2 



Matrix Lie Groups 



1. Definition of a Matrix Lie Group 

Recall that the general linear group over the reals, denoted GL(n;M), is the 
group of all n x n invertible matrices with real entries. We may similarly define 
GL(n; C) to be the group of all n x n invertible matrices with complex entries. Of 
course, GL(n;R) is contained in GL(n;C). 

Definition 2.1. Let A n be a sequence of complex matrices. We say that A n 
converges to a matrix A if each entry of A n converges to the corresponding entry 
of A, i.e., if (An)- converges to Aij for all 1 < i, j < n. 



Definition 2.2. A matrix Lie group is any subgroup H o/GL(n; C) with the 
following property: if A n is any sequence of matrices in H , and A n converges to 
some matrix A, then either A S H , or A is not invertible. 

The condition on H amounts to saying that H is a closed subset of GL(n; C). 
(This is not the same as saying that H is closed in the space of all matrices.) Thus 



Definition 2.2 is equivalent to saying that a matrix Lie group is a closed subgroup 
ofGL(n;C). 

The condition that H be a closed subgroup, as opposed to merely a subgroup, 
should be regarded as a technicality, in that most of the interesting subgroups of 
GL(n; C) have this property. (Almost all of the matrix Lie groups H we will consider 
have the stronger property that if A n is any sequence of matrices in H, and A n 
converges to some matrix A, then A 6 H .) 

There is a topological structure on the set ofnxn complex matrices which 
goes with the above notion of convergence. This topological structure is defined by 

2 

identifying the space ofnxn matrices with C™ in the obvious way and using the 
usual topological structure on C™ . 

1.1. Counterexamples. An example of a subgroup of GL(n;C) which is not 
closed (and hence is not a matrix Lie group) is the set of all n x n invertible 
matrices all of whose entries are real and rational. This is in fact a subgroup of 
GL(rt;C), but not a closed subgroup. That is, one can (easily) have a sequence 
of invertible matrices with rational entries converging to an invertible matrix with 
some irrational entries. (In fact, every real invertible matrix is the limit of some 
sequence of invertible matrices with rational entries.) 

Another example of a group of matrices which is not a matrix Lie group is the 
following subgroup of GL(2,C). Let a be an irrational real number, and let 



<) 
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Clearly, H is a subgroup of GL(2,C). Because a is irrational, the matrix — / is not 
in H, since to make e equal to —1, we must take t to be an odd integer multiple 
of 7T, in which case ta cannot be an odd integer multiple of tt. On the other hand, 
by taking t = (2n + l)n for a suitably chosen integer n, we can make ta arbitrarily 
close to an odd integer multiple of tt. (It is left to the reader to verify this.) Hence 
we can find a sequence of matrices in H which converges to —I, and so H is not a 
matrix Lie group. See Exercise [l]. 

2. Examples of Matrix Lie Groups 

Mastering the subject of Lie groups involves not only learning the general the- 
ory, but also familiarizing oneself with examples. In this section, we introduce some 
of the most important examples of (matrix) Lie groups. 

2.1. The general linear groups GL(rt;R) and GL(rt;C). The general linear 
groups (over R or C) are themselves matrix Lie groups. Of course, GL(n;C) is a 
subgroup of itself. Furthermore, if A n is a sequence of matrices in GL(n; C) and A n 
converges to A, then by the definition of GL(n; C), either A is in GL(n; C), or A is 
not invertible. 

Moreover, GL(n;R) is a subgroup of GL(n;C), and if A n £ GL(n;R), and A n 
converges to A, then the entries of A are real. Thus either A is not invertible, or 
A e GL(n;R). 

2.2. The special linear groups SL(n; R) and SL(n; C). The special linear 
group (over R or C) is the group of n x n invertible matrices (with real or complex 
entries) having determinant one. Both of these are subgroups of GL(n; C), as noted 
in Chapter 1. Furthermore, if A n is a sequence of matrices with determinant one, 
and A n converges to A, then A also has determinant one, because the determinant 
is a continuous function. Thus SL(n;R) and SL(?i;C) are matrix Lie groups. 

2.3. The orthogonal and special orthogonal groups, 0(n) and SO(n). 
An n x n real matrix A is said to be orthogonal if the column vectors that make 
up A are orthonormal, that is, if 

a 

^AijAik = 5 jk 

i=l 

Equivalently, A is orthogonal if it preserves the inner product, namely, if (x, y) = 
(Ax, Ay) for all vectors x, y in R™. ( Angled brackets denote the usual inner product 
on R™, (x, y) = a^y,.) Still another equivalent definition is that A is orthogonal 
if A tr A = I, i.e., if A tr = A" 1 . (A tr is the transpose of A, (A* r ) y = A 3i .) See 
Exercise ||. 

Since detA tr = det^4, we see that if A is orthogonal, then det(A' r ^4) = 
(det A) 2 = detl — 1. Hence det A = ±1, for all orthogonal matrices A. 

This formula tells us, in particular, that every orthogonal matrix must be in- 
vertible. But if A is an orthogonal matrix, then 

(A-^A^y) = {A(A- 1 x) , A (j4 _1 a;)) = (x,y) 

Thus the inverse of an orthogonal matrix is orthogonal. Furthermore, the product 
of two orthogonal matrices is orthogonal, since if A and B both preserve inner 
products, then so does AB. Thus the set of orthogonal matrices forms a group. 
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The set of all n x n real orthogonal matrices is the orthogonal group 0(n), 
and is a subgroup of GL(n;C). The limit of a sequence of orthogonal matrices is 
orthogonal, because the relation A tr A = I is preserved under limits. Thus 0(n) is 
a matrix Lie group. 

The set ofnxn orthogonal matrices with determinant one is the special or- 
thogonal group SO(n). Clearly this is a subgroup of 0(n), and hence of GL(n; C). 
Moreover, both orthogonality and the property of having determinant one are pre- 
served under limits, and so SO(n) is a matrix Lie group. Since elements of 0(n) 
already have determinant ±1, SO(n) is "half" of 0(n). 

Geometrically, elements of 0(n) are either rotations, or combinations of rota- 
tions and reflections. The elements of SO(n) are just the rotations. 

See also Exercise [| 

2.4. The unitary and special unitary groups, U(n) and SU(n). An nxn 
complex matrix A is said to be unitary if the column vectors of A are orthonormal, 
that is, if 

n 

^ AijAik = Sjk 

i=l 

Equivalently, A is unitary if it preserves the inner product, namely, if (x,y) = 
(Ax, Ay) for all vectors x, y in C™. (Angled brackets here denote the inner product 
on C n , (x,y) = 'Ylii x ~iVi- We will adopt the convention of putting the complex 
conjugate on the left.) Still another equivalent definition is that A is unitary if 
A* A = I, i.e., if A* = A' 1 . (A* is the adjoint of A, {A*).j = A~.) See Exercise |. 

Since detA* = det A, we see that if A is unitary, then det (A* A) — | det A | 2 = 
det I = 1. Hence |det A\ = 1, for all unitary matrices A. 

This in particular shows that every unitary matrix is invertible. The same 
argument as for the orthogonal group shows that the set of unitary matrices forms 
a group. 

The set of all nxn unitary matrices is the unitary group U(n), and is a 
subgroup of GL(n; C). The limit of unitary matrices is unitary, so U(n) is a matrix 
Lie group. The set of unitary matrices with determinant one is the special unitary 
group SU(n). It is easy to check that SU(n) is a matrix Lie group. Note that a 
unitary matrix can have determinant e* e for any 9, and so SU(n) is a smaller subset 
of U(n) than SO(n) is of 0(n). (Specifically, SO(n) has the same dimension as 
0(n), whereas SU(n) has dimension one less than that of U(n).) 

See also Exercise |[ 

2.5. The complex orthogonal groups, 0(n;C) and SO(n;C). Consider 
the bilinear form ( ) on C™ defined by (x,y) — ^x^i. This form is not an inner 
product, because of the lack of a complex conjugate in the definition. The set of all 
nxn complex matrices A which preserve this form, (i.e., such that {Ax, Ay) — [x, y) 
for all x,y € C") is the complex orthogonal group 0(n;C), and is a subgroup 
of GL(n;C). (The proof is the same as for 0(n).) An nxn complex matrix A is 
in 0(n; C) if and only if A tr A = I. It is easy to show that 0(n; C) is a matrix Lie 
group, and that detA = ±1, for all A in 0(n;C). Note that 0(n;C) is not the 
same as the unitary group U(n). The group SO(n; C) is defined to be the set of all 
A in 0(n; C) with det A = 1. Then SO(n; C) is also a matrix Lie group. 
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2.6. The generalized orthogonal and Lorentz groups. Let n and k be 

positive integers, and consider R' i+fc . Define a symmetric bilinear form [ ] n+k on 
R n + fe by the formula 

(2.1) [x, y] nk = xiyi H h x„y n - x n+ iy n+ i y n +kx n +k 

The set of (n + k) x (n + k) real matrices A which preserve this form (i.e., such that 
[Ax, Ay] n k = [x,y\ n k for all x,y £ M. n+k ) is the generalized orthogonal group 
0(n; k), and it is a subgroup of GL(n + k; R) (Ex. Since 0(n; k) and 0(fc; n) arc 
essentially the same group, we restrict our attention to the case n > k. It is not 
hard to check that 0(n; k) is a matrix Lie group. 

If A is an (n + k) x (n + k) real matrix, let AW denote the i th column vector 
of A, that is 




Then A is in 0(n; fc) if and only if the following conditions are satisfied: 

i+j 

1 1 < i < n 
-1 n+l<i<n + k 

Let g denote the (n + k) x (n + fc) diagonal matrix with ones in the first n 
diagonal entries, and minus ones in the last k diagonal entries. Then A is in 0(n; k) 
if and only if A tr gA = g (Ex. Taking the determinant of this equation gives 
(det A) 2 det g = det g, or (det A) 2 = 1. Thus for any A in 0(n; fc), det A = ±1. 

The group SO(n; k) is defined to be the set of matrices in 0(n; k) with det A = 1. 
This is a subgroup of GL(n + fc; M), and is a matrix Lie group. 

Of particular interest in physics is the Lorentz group 0(3; 1). (Sometimes 
the phrase Lorentz group is used more generally to refer to the group 0(n; 1) for 
any n > 1.) See also Exercise (?| 

2.7. The symplectic groups Sp(rt;IR), Sp(n;C), and Sp(n). The special 
and general linear groups, the orthogonal and unitary groups, and the symplectic 
groups (which will be defined momentarily) make up the classical groups. Of the 
classical groups, the symplectic groups have the most confusing definition, partly 
because there are three sets of them (Sp(n;M), Sp(n;C), and Sp(n)), and partly 
because they involve skew-symmetric bilinear forms rather than the more familiar 
symmetric bilinear forms. To further confuse matters, the notation for referring to 
these groups is not consistent from author to author. 

Consider the skew-symmetric bilinear form B on M. 2n defined as follows: 

n 

(2.3) B [X, y] = ^ X iVn+% ~ Xn+iVi 

i=l 

The set of all 2n x 2n matrices A which preserve B (i.e., such that B [Ax, Ay] = 
B[x,y] for all x,y S R 2n ) is the real symplectic group Sp(n;R), and it is a 
subgroup of GL(2n;M). It is not difficult to check that this is a matrix Lie group 



(2.2) H (l U W L, fc = 

[^U (4) L, = 
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(Exercise ||). This group arises naturally in the study of classical mechanics. If J 
is the 2n x 2n matrix 



J 



I 

-I 



then B [x, y] = (x, Jy), and it is possible to check that a 2n x 2n real matrix A is in 
Sp(n; R) if and only if A tr JA = J. (See Exercise |[) Taking the determinant of this 
identity gives (det A) 2 det J — det J, or (det A) 2 = 1. This shows that det A = ±1, 
for all A £ Sp(n;R). In fact, detA = 1 for all A £ Sp(n;R), although this is not 
obvious. 

One can define a bilinear form on C n by the same formula ( |2.3[ ) . (This form is 
bilinear, not Hermitian, and involves no complex conjugates.) The set of 2n x 2n 
complex matrices which preserve this form is the complex symplectic group 
Sp(n;C). A 2n x 2n complex matrix A is in Sp(n;C) if and only if A tr JA = J. 
(Note: this condition involves A tr , not A*.) This relation shows that detA = ±1, 
for all A £ Sp(n;C). In fact det A = 1, for all A £ Sp(n;C). 

Finally, we have the compact symplectic group Sp(n) defined as 

Sp(n) = Sp(n;C) H U(2n). 

See also Exercise |9[ For more information and a proof of the fact that det A = 1 , 
for all A £ Sp(n; C), see Miller, Sect. 9.4. What we call Sp (n; C) Miller calls Sp(n), 
and what we call Sp(n), Miller calls USp(n). 

2.8. The Heisenberg group H. The set of all 3 x 3 real matrices A of the 
form 




(2.4) A = 

where a, b, and c are arbitrary real numbers, is the Heisenberg group. It is easy 



to check that the product of two matrices of the form (2.4) is again of that form, and 



clearly the identity matrix is of the form (2.4). Furthermore, direct computation 



shows that if A is as in (2.4), then 




A~ 



Thus H is a subgroup of GL(3;R). Clearly the limit of matrices of the form fl2.4|) 
is again of that form, and so H is a matrix Lie group. 

It is not evident at the moment why this group should be called the Heisenberg 
group. We shall see later that the Lie algebra of H gives a realization of the 
Heisenberg commutation relations of quantum mechanics. (See especially Chapter 
5, Exercise |l0| .) 

See also Exercise [l0| 

2.9. The groups W, C*, S 1 , R, and R™. Several important groups which 
are not naturally groups of matrices can (and will in these notes) be thought of as 
such. 

The group R* of non-zero real numbers under multiplication is isomorphic to 
GL(1,R). Thus we will regard R* as a matrix Lie group. Similarly, the group C* 
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of non-zero complex numbers under multiplication is isomorphic to GL(1;C), and 
the group S 1 of complex numbers with absolute value one is isomorphic to U(l). 

The group R under addition is isomorphic to GL(1; R)+ (lxl real matrices with 
positive determinant) via the map x — > [e x ]. The group R" (with vector addition) 
is isomorphic to the group of diagonal real matrices with positive diagonal entries, 
via the map 

/ e Xl 
\ e x " 

2.10. The Euclidean and Poincare groups. The Euclidean group E(n) 
is by definition the group of all one-to-one, onto, distance-preserving maps of R™ 
to itself, that is, maps / : R™ — > R™ such that d(f(x),f(y)) — d(x,y) for all 
x,y G R". Here d is the usual distance on R™, d(x,y) = \x — y\ . Note that we 
don't assume anything about the structure of / besides the above properties. In 
particular, / need not be linear. The orthogonal group 0(n) is a subgroup of E(n), 
and is the group of all linear distance-preserving maps of R™ to itself. The set of 
translations of 1" (i.e., the set of maps of the form T x {y) = x + y) is also a subgroup 
ofE(n). 

Proposition 2.3. Every element T of E(n) can be written uniquely as an or- 
thogonal linear transformation followed by a translation, that is, in the form 

T = T X R 

with x e R™, and R e O(n). 

We will not prove this here. The key step is to prove that every one-to-one, 
onto, distance-preserving map of R™ to itself which fixes the origin must be linear. 

Following Miller, we will write an element T = T X R of E(n) as a pair {x, R}. 
Note that for y e R™, 

{x, R}y = Ry + x 

and that 

{xi, Ri}{x 2 , R 2 }y = Ri(R2y + x 2 ) + x\ = RiR 2 y + {x\ + Rix 2 ) 
Thus the product operation for E(n) is the following: 

(2.5) { Xl , R!}{x 2 , R 2 } = {x! + R lX2 ,RiR 2 } 
The inverse of an element of E(n) is given by 

{x^y 1 = {-R^x^R- 1 } 

Now, as already noted, E(n) is not a subgroup of GL(n;R), since translations 
are not linear maps. However, E(n) is isomorphic to a subgroup of GL(n + 1;R), 
via the map which associates to {x, R} € E(n) the following matrix 

/ *i \ 

(2.6) R '■ 

\ o ••• o 1 J 

This map is clearly one-to-one, and it is a simple computation to show that it is a 
homomorphism. Thus E(n) is isomorphic to the group of all matrices of the form 
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( p^ ) (with R £ O(n)). The limit of things of the form ( |2.6| ) is again of that form, 
and so we have expressed the Euclidean group E(n) as a matrix Lie group. 

We similarly define the Poincare group P(n; 1) to be the group of all transfor- 
mations of M n+1 of the form 



T = T T A 



with x £ R n+1 , A £ 0(n; 1). This is the group of affine transformations of K. n+1 
which preserve the Lorentz "distance" dx,(x, y) — (x\ — yi) 2 + • • • + (x n — y n ) 2 — 
(x n+ i — y n +i) 2 - (An affine transformation is one of the form x — > Ax + 6, where 
A is a linear transformation and b is constant.) The group product is the obvious 
analog of the product (2.5) for the Euclidean group. 

The Poincare group P(n; 1) is isomorphic to the group of (n + 2) x (n + 2) 
matrices of the form 



(2.7) 



Xl \ 



A 



X n +1 

\o ... 1 / 



with A £ 0(n; 1). The set of matrices of the form (2.7) is a matrix Lie group. 



3. Compactness 

Definition 2.4. A matrix Lie group G is said to be compact if the following 
two conditions are satisfied: 

1. If A n is any sequence of matrices in G, and A n converges to a matrix A, 
then A is in G. 

2. There exists a constant C such that for all A £ G, |Ay| < C for all 1 < 
i,j < n. 

This is not the usual topological definition of compactness. However, the set 

2 

of all n x n complex matrices can be thought of as C™ . The above definition says 

2 

that G is compact if it is a closed, bounded subset of C" . It is a standard theorem 
from elementary analysis that a subset of C m is compact (in the usual sense that 
every open cover has a finite subcover) if and only if it is closed and bounded. 

All of our examples of matrix Lie groups except GL(n;R) and GL(n;C) have 
property (|I]). Thus it is the boundedness condition (^|) that is most important. 

The property of compactness has very important implications. For exam- 
ple, if G is compact, then every irreducible unitary representation of G is finite- 
dimensional. 



3.1. Examples of compact groups. The groups 0(n) and SO(n) are com- 
pact. Property (0) is satisfied because the limit of orthogonal matrices is orthogonal 
and the limit of matrices with determinant one has determinant one. Property (||) 
is satisfied because if A is orthogonal, then the column vectors of A have norm one, 
and hence 1^4^! < 1, for all 1 < i,j < n. A similar argument shows that U(n), 
SU(n), and Sp(n) are compact. (This includes the unit circle, S 1 = U(l).) 
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3.2. Examples of non-compact groups. All of the other examples given 
of matrix Lie groups are non-compact. GL(n; R) and GL(n; C) violate property ([!]), 
since a limit of invertible matrices may be non-invertible. SL(n;R) and SL(n;C) 
violate (0), except in the trivial case n = 1, since 

( n \ 

n 

A - 1 

V i / 

has determinant one, no matter how big n is. 

The following groups also violate (0), and hence are non-compact: 0(n; C) and 
SO(n;C); 0(n; k) and SO(n;fc) (n > 1, k > 1); the Heisenberg group H; Sp(n;R) 
and Sp(n;C); E(n) and P(n; 1); R and R"; R* and C*. It is left to the reader to 
provide examples to show that this is the case. 

4. Connectedness 

Definition 2.5. A matrix Lie group G is said to be connected if given any 
two matrices A and B in G, there exists a continuous path A(t), a < t < b, lying 
in G with A(a) = A, and A(b) = B. 

This property is what is called path-connected in topology, which is not (in 
general) the same as connected. However, it is a fact (not particularly obvious at 
the moment) that a matrix Lie group is connected if and only if it is path-connected. 
So in a slight abuse of terminology we shall continue to refer to the above property 
as connectedness. (See Section [z|) 

A matrix Lie group G which is not connected can be decomposed (uniquely) 
as a union of several pieces, called components, such that two elements of the 
same component can be joined by a continuous path, but two elements of different 
components cannot. 

Proposition 2.6. If G is a matrix Lie group, then the component of G con- 
taining the identity is a subgroup ofG. 

Proof. Saying that A and B are both in the component containing the identity 
means that there exist continuous paths A(t) and B{t) with A(0) = B(0) = /, 
A(l) = A, and B(l) = B. But then A(t)B(t) is a continuous path starting at I and 
ending at AB. Thus the product of two elements of the identity component is again 
in the identity component. Furthermore, A(t)^ 1 is a continuous path starting at / 
and ending at A -1 , and so the inverse of any element of the identity component is 
again in the identity component. Thus the identity component is a subgroup. □ 

PROPOSITION 2.7. The group GL(n;C) is connected for all n>l. 

Proof. Consider first the case n = 1. A 1 x 1 invertible complex matrix A is 
of the form A = [A] with A € C*, the set of non-zero complex numbers. But given 
any two non-zero complex numbers, we can easily find a continuous path which 
connects them and does not pass through zero. 

For the case n > 1, we use the Jordan canonical form. Every n x n complex 
matrix A can be written as 

A = CBC' 1 
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where B is the Jordan canonical form. The only property of B we will need is that 
B is upper-triangular: 



B = 



( Ai 



V A n ) 

If A is invcrtible, then all the Aj's must be non-zero, since det A — det B = Ai • • • A„. 

Let B(t) be obtained by multiplying the part of B above the diagonal by (1 — t), 
for < t < 1, and let A(t) = Ci?(t)C _1 . Then A(t) is a continuous path which 
starts at A and ends at CDC" 1 , where D is the diagonal matrix 



D 



( Ai 



V o 



\ 



A„ / 



This path lies in GL(n; C) since det A{t) = \\ ■ ■ ■ X n for all t. 

But now, as in the case n — 1, we can define Xi{t) which connects each to 1 
in C*, as t goes from 1 to 2. Then we can define 



A(t) = C 



( Ai(t) 



V o 







A„(i) 



c- 1 



This is a continuous path which starts at CDC" 1 when t = 1, and ends at / 
(= CIC^ 1 ) when t = 2. Since the Aj(t)'s are always non-zero, A(t) lies in GL(n; C). 

We see, then, that every matrix A in GL(n; C) can be connected to the identity 
by a continuous path lying in GL(n;C). Thus if A and B arc two matrices in 
GL(n; C), they can be connected by connecting each of them to the identity. □ 



Proposition 2.8. The group SL(n;C) is connected for all n > 1. 

Proof. The proof is almost the same as for GL(n;C), except that we must 
be careful to preserve the condition det A = 1 . Let A be an arbitrary element of 
SL (n; C). The case n = 1 is trivial, so we assume n>2. We can define A(t) as above 
forO < t < 1, with A(0) = A, and A(l) = CDC- 1 , since dot A(t) = det A = 1. Now 
define Xi(t) as before for 1 < i < n— 1, and define X n (t) to be [Xi(t) ■ ■ ■ A„_i(t)] . 
(Note that since Ai • • • A„ = 1, A„(0) = A„.) This allows us to connect A to the 
identity while staying within SL (n; C). □ 



Proposition 2.9. The groups U(n) and SU(n) are connected, for all n> 1. 



PROOF. By a standard result of linear algebra, every unitary matrix has an 
orthonormal basis of eigenvectors, with eigenvalues of the form e 10 . It follows that 
every unitary matrix U can be written as 



/ e 1 ' 



\ 



(2.8) 



U = Ui 
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with Ui unitary and 9i € R. Conversely, as is easily checked, every matrix of the 



form (2.8) is unitary. Now define 

/ e *(i-*)0i 

u(t) = uA ■.. | ' 

y o e i(i-*)«» 

As t ranges from to 1, this defines a continuous path in U(n) joining U to /. This 
shows that U(n) is connected. 



A slight modification of this argument, as in the proof of Proposition 2.8, shows 



that SU(n) is connected. □ 

Proposition 2.10. The group GL(n;R) is not connected, but has two compo- 
nents. These are GL(n;R) + , the set ofnxn real matrices with positive determinant, 
and GL(n;R) _ , the set of n x n real matrices with negative determinant. 

Proof. GL(n; R) cannot be connected, for if det A > and det B < 0, then any 
continuous path connecting A to B would have to include a matrix with determinant 
zero, and hence pass outside of GL(n;R). 

The proof that GL(n; R) + is connected is given in Exercise 0. Once GL(n;R)+ 
is known to be connected, it is not difficult to see that GL(n; R)~ is also connected. 
For let C be any matrix with negative determinant, and take A,B in GL(n;R) _ . 
Then C _1 A and C~ X B are in GL(n;R) + , and can be joined by a continuous path 
D(t) in GL(n;R) + . But then CD(t) is a continuous path joining A and B in 
GL(n;R) _ . □ 

The following table lists some matrix Lie groups, indicates whether or not the 
group is connected, and gives the number of components. 



Group 


Connected? 


Compo 


GL(n;C) 


yes 


1 


SL(n;C) 


yes 


1 


GL(n;R) 


no 


2 


SL(n;R) 


yes 


f 


O(n) 


no 


2 


SO(n) 


yes 


f 


U(n) 


yes 


f 


SU(n) 


yes 


f 


0(n;l) 


no 


4 


SO(n; 1) 


no 


2 


Heisenberg 


yes 


f 


E(n) 


no 


2 


P(n;l) 


no 


4 



Proofs of some of these results are given in Exercises |, [Tl], and [u|. (The 
connectedness of the Heisenberg group is immediate.) 

5. Simple-connectedness 

Definition 2.11. A connected matrix Lie group G is said to be simply con- 
nected if every loop in G can be shrunk continuously to a point in G. 

More precisely, G is simply connected if given any continuous path A{t), < 
t < 1; lying in G with A(0) = A{\), there exists a continuous function A{s,t), 
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< s,t < 1, taking values in G with the following properties: 1) A(s,0) = A(s, 1) 
for all s, 2) A(0, t) = A{t), and 3) A(l, t) = A{1, 0) for all t. 

You should think of A(t) as a loop, and A(s, t) as a parameterized family of 
loops which shrinks A(t) to a point. Condition 1) says that for each value of the 
parameter s, we have a loop; condition 2) says that when s = the loop is the 
specified loop A(t); and condition 3) says that when s = 1 our loop is a point. 

It is customary to speak of simple-connectedness only for connected matrix Lie 
groups, even though the definition makes sense for disconnected groups. 

Proposition 2.12. The group SU(2) is simply connected. 

Proof. Exercise || shows that SU(2) may be thought of (topologically) as the 
three-dimensional sphere S 3 sitting inside R 4 . It is well-known that S 3 is simply 
connected. □ 

The condition of simple-connectedness is extremely important. One of our most 
important theorems will be that if G is simply connected, then there is a natural 
one-to-one correspondence between the representations of G and the representations 
of its Lie algebra. 

Without proof, we give the following table. 

Group Simply connected? 

GL(n; C) no 

SL (n; C) yes 

GL(n; R) no 

SL(n;R) no 

SO(n) no 

U(n) no 

SU(ro) yes 

SO(l;l) yes 

SO(n; 1) (n > 2) no 

Heisenberg yes 



6. Homomorphisms and Isomorphisms 

Definition 2.13. Let G and H be matrix Lie groups. A map 4> from G to H 
is called a Lie group homomorphism if 1) <f> is a group homomorphism and 2) 
4> is continuous. If in addition, (f> is one-to-one and onto, and the inverse map </> _1 
is continuous, then <f> is called a Lie group isomorphism. 

The condition that <j) be continuous should be regarded as a technicality, in 
that it is very difficult to give an example of a group homomorphism between two 
matrix Lie groups which is not continuous. In fact, if G = R and H = C*, then 
any group homomorphism from G to H which is even measurable (a very weak 
condition) must be continuous. (See W. Rudin, Real and Complex Analysis, Chap. 
9, Ex. 17.) 

If G and H are matrix Lie groups, and there exists a Lie group isomorphism 
from G to H, then G and H are said to be isomorphic, and we write G = H . Two 
matrix Lie groups which are isomorphic should be thought of as being essentially 
the same group. (Note that by definition, the inverse of Lie group isomorphism is 
continuous, and so also a Lie group isomorphism.) 
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6.1. Example: SU(2) and S0(3). A very important topic for us will be the 
relationship between the groups SU(2) and SO (3). This example is designed to 
show that SU(2) and SO(3) are almost (but not quite!) isomorphic. Specifically, 
there exists a Lie group homomorphism <f> which maps SU(2) onto SO(3), and which 
is two-to-one. (See Miller 7.1 and Brocker, Chap. I, 6.18.) 

Consider the space V of all 2 x 2 complex matrices which are self-adjoint and 
have trace zero. This is a three-dimensional real vector space with the following 
basis 

M = ( ? I ) ; M = ( -i o ) 5 M = ( o -°i ) 

We may define an inner product on V by the formula 

(A, B) = -trace(AB) 

(Exercise: check that this is an inner product.) 

Direct computation shows that {Ai, A%, A3} is an orthonormal basis for V. 
Having chosen an orthonormal basis for V, we can identify V with R 3 . 

Now, if U is an element of SU(2), and A is an element of V, then it is easy to 
see that UAU^ 1 is in V. Thus for each U <E SU(2), we can define a linear map <j>u 
of V to itself by the formula 

<j>u{A) = UAU- 1 

(This definition would work for U 6 U(2), but we choose to restrict our attention 
to SU(2).) Moreover, given U € SU(2), and A, B g V, note that 

{^u(A),i>u{B)) = ^tr&ceiUAU^UBU- 1 ) = -trace(AB) = (A,B) 

Thus <pu is an orthogonal transformation of V = M 3 , which we can think of as an 
element of 0(3). 

We see, then, that the map U — > 4>u is a map of SU(2) into 0(3). It is very 
easy to check that this map is a homomorphism (i.e., <pUiU 2 = 4 , Ui4 > u 2 )^ an d that 
it is continuous. Thus U — * 4>u is a Lie group homomorphism of SU(2) into 0(3). 

Recall that every element of 0(3) has determinant ±1. Since SU(2) is connected 
(Exercise ||) , and the map U — > </>j/ is continuous, 4>u must actually map into SO (3). 
Thus U — * 4>u is a Lie group homomorphism of SU(2) into S0(3). 

The map ?7 — ^ 0t/ is not one-to-one, since for any U G SU(2), 4>u — <j)-u- 
(Observe that if U is in SU(2), then so is —U.) It is possible to show that 4>u 1S a 
two-to-one map of SU(2) onto S0(3). (See Miller.) 



7. Lie Groups 

A Lie group is something which is simultaneously a group and a differentiable 



manifold (see Definition 2.14). As the terminology suggests, every matrix Lie group 



is a Lie group, although this requires proof (Theorem p,15| ). I have decided to 
restrict attention to matrix Lie groups, except in emergencies, for three reasons. 
First, this makes the course accessible to students who are not familiar with the 
theory of differentiable manifolds. Second, this makes the definition of the Lie 
algebra and of the exponential mapping far more comprehensible. Third, all of the 
important examples of Lie groups are (or can easily be represented as) matrix Lie 
groups. 
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Alas, there is a price to pay for this simplification. Certain important topics 
(notably, the universal cover) are considerably complicated by restricting to the 
matrix case. Nevertheless, I feel that the advantages outweigh the disadvantages in 
an introductory course such as this. 

Definition 2.14. A Lie group is a differentiable manifold G which is also a 
group, and such that the group product 

GxG->G 

and the inverse map g — > g^ 1 are differentiable. 

For the reader who is not familiar with the notion of a differentiable manifold, 
here is a brief recap. (I will consider only manifolds embedded in some R™, which is a 
harmless assumption.) A subset M of R n is called a /c-dimensional differentiable 
manifold if given any mo € M, there exists a smooth (non-linear) coordinate 
system (x 1 , ■ ■ ■ x n ) defined in a neighborhood U of to such that 

Mfl U = {to € U |a; fe+1 (TO) = Ci, • • • ,x n (m) = c„_ fc } 

This says that locally, after a suitable change of variables, M looks like the k- 
dimensional hyperplanc in R™ obtained by setting all but the first k coordinates 
equal to constants. 

For example, S 1 C R 2 is a one-dimensional differentiable manifold because in 
the usual polar coordinates (0,r), S 1 is the set r = 1. Of course, polar coordinates 
arc not globally defined, because 6 is undefined at the origin, and because is not 
"single- valued." But given any point too in S 1 , we can define polar coordinates in 
a neighborhood U of mo, and then S 1 n U will be the set r = 1. 

Note that while we assume that our differentiable manifolds are embedded in 
some R™ (a harmless assumption), we are not saying that a Lie group has to be 

2 

embedded in R" , or that the group operation has to have anything to do with 
matrix multiplication. A Lie group is simply a subset G of some R™ which is a 
differentiable manifold, together with any map from G x G into G which makes 
G into a group (and such that the group operations are smooth). It is remarkable 
that almost (but not quite!) every Lie group is isomorphic to a matrix Lie group. 

Note also that it is far from obvious that a matrix Lie group must be a Lie 
group, since our definition of a matrix Lie group G does not say anything about G 
being a manifold. It is not too difficult to verify that all of our examples of matrix 
Lie groups are Lie groups, but in fact we have the following result which makes 
such verifications unnecessary: 

Theorem 2.15. Every matrix Lie group is a Lie group. 

Although I will not prove this result, I want to discuss what would be involved. 
Let us consider first the group GL(n;R). The space of all n x n real matrices can 
be thought of as R n . Since GL(n;R) is the set of all matrices A with det^4 ^ 0, 
GL(n;R) is an open subset of M™ . (That is, given an invertible matrix A, there 
is a neighborhood U of A such that every matrix B £ U is also invertible.) Thus 
GL(n;R) is an n 2 -dimensional smooth manifold. Furthermore, the matrix product 
AB is clearly a smooth (even polynomial) function of the entries of A and J5, and 
(in light of Kramer's rule) A^ 1 is a smooth function of the entries of A. Thus 
GL(n;R) is a Lie group. 
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Similarly, if we think of the space ofnxn complex matrices as C ra = R 2n , 
then the same argument shows that GL(n; C) is a Lie group. 

Thus, to prove that every matrix Lie group is a Lie group, it suffices to show 
that a closed subgroup of a Lie group is a Lie group. This is proved in Brocker and 
torn Dieck, Chapter I, Theorem 3.11. The proof is not too difficult, but it requires 
the exponential mapping, which we have not yet introduced. (See Chapter 3.) 

It is customary to call a map <j) between two Lie groups a Lie group homomor- 
phism if is a group homomorphism and 4> is smooth, whereas we have (in Definition 



2.13) required only that (f> be continuous. However, the following Proposition shows 



that our definition is equivalent to the more standard one. 

Proposition 2.16. Let G and H be Lie groups, and cj) a group homomorphism 
from G to H. Then if (p is continuous it is also smooth. 

Thus group homomorphisms from G to H come in only two varieties: the very 
bad ones (discontinuous), and the very good ones (smooth). There simply aren't 
any intermediate ones. (See, for example, Exercise |l6|. ) For proof, see Brocker and 
torn Dieck, Chapter I, Proposition 3.12. 



In light of Theorem 2.15, every matrix Lie group is a (smooth) manifold. As 
such, a matrix Lie group is automatically locally path connected. It follows that 
a matrix Lie group is path connected if and only if it is connected. (See Remarks 



following Definition 2.5.) 



8. Exercises 

1. Let a be an irrational real number. Show that the set of numbers of the 
form e 2nma j n e Z, is dense in S 1 . Now let G be the following subgroup of 
GL(2;C): 

G ={( 6 o e -)'* GR } 

Show that 

S ={(o £ )l*'* eR }' 

where G denotes the closure of the set G inside the space of 2 x 2 matrices. 

Note: The group G can be thought of as the torus S 1 x S 1 , which in 
turn can be thought of as [0, 2tt] x [0, 2ir], with the ends of the intervals 
identified. The set G C [0, 27r] x [0, 2ir] is called an irrational line. Draw 
a picture of this set and you should see why G is dense in [0, 2tt] x [0, 2w\. 

2. Orthogonal groups. Let ( ) denote the standard inner product on R™, (x, y) — 

Xiy.i. Show that a matrix A preserves inner products if and only if the 
column vectors of A are orthonormal. 

Show that for any n x n real matrix B, 

(Bx,y) = (x,B tr y) 

where (-B )y — Bji. Using this fact, show that a matrix A preserves inner 
products if and only if A tr A = I. 

Note: a similar analysis applies to the complex orthogonal groups 0(n; C) 
and SO(rt; C). 
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3. Unitary groups. Let ( ) denote the standard inner product on C n , (x, y) = 
J2i ~iVi- Following Exercise |[ show that A* A = I if and only if (Ax, Ay) = 
(x,y) for all x,y E C n . ((A*) tj = A~.) 

4. Generalized orthogonal groups. Let [x, y] n k be the symmetric bilinear form 
on W l+k defined in ( 2A ) . Let g be the (n + k) x (n + k) diagonal matrix 
with first n diagonal entries equal to one, and last k diagonal entries equal 
to minus one: 

In 
-I k 

Show that for all x, y € R" +fc , 

l x >y] n ,k = ( x ^y) 

Show that a (n + k) x (n + k) real matrix A is in 0(n; k) if and only if 
A tr gA — g. Show that 0(n; k) and SO(n; k) are subgroups of GL(n + k; M), 
and are matrix Lie groups. 

5. Symplectic groups. Let B [x, y] be the skew-symmetric bilinear form on R 2 ™ 
given by B [x, y] — x iVn+i — x n +iyi. Let J be the 2n x 2rt matrix 



J = 

Show that for all x, y € R 2 ™ 



J 
-7 



Show that a 2n x 2n matrix A is in Sp (n; R) if and only if A tr JA — J. Show 
that Sp (n; R) is a subgroup of GL(2n; R), and a matrix Lie group. 
Note: a similar analysis applies to Sp (n; C). 
6. T/ie groups 0(2) anc? S0(2). Show that the matrix 



A = 



cos ^ — sm u 
sin 6* cos 



is in SO (2), and that 



cos ft — sin ft \ / cos0 —smq> \ _l cos(ft + 0j — sm(ft 
sin 9 cos 9 J \ sin cos ^ J \ sin(8 + 0) cos(9 - 

Show that every element A of 0(2) is of one of the two forms 

, . cos 9 — sin ft 
sin 9 cos 9 



A = 



cos 9 sin (9 
sin — cos 6 



(If ^4 is of the first form, then detA = 1; if A is of the second form, then 
detA = -1.) 

Hint: Recall that for A = ( a ^ J to be in 0(2), the column vectors 



and ( j ) must be unit vectors, and must be orthogonal. 
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7. The groups 0(1; 1) and S0(1;1). Show that 

.4 = 



cosh t sinh t 
sinh t cosh t 



is in S0(1; 1), and that 



coshi sinhi \ / coshs sinhs \ / cosh(f + s) sinh(i + s) 
sinhi cosht J \ sinhs coshs J I sinh(i + s) cosh(i + s) 

Show that every element of 0(1; 1) can be written in one of the four forms 

coshi sinht 
sinh t cosh t 



- cosh t sinh t 
sinhi — coshi 

cosh t — sinh t 
sinh£ — coshi 

-cosht — sinht 
sinh t cosh t 

(Since cosht is always positive, there is no overlap among the four cases. 
Matrices of the first two forms have determinant one; matrices of the last 
two forms have determinant minus one.) 

Hint: For ( ° ^ to be in 0(1; 1), we must have a 2 — c 2 = 1, b 2 — d 2 = 

— 1, and ab — cd — 0. The set of points (a, c) in the plane with a 2 — c 2 = 1 
(i.e., a = ivl + c 2 ) is a hyperbola. 

The group SU(2). Show that if a, (3 are arbitrary complex numbers satisfying 
lad 2 + \B\ 2 = 1, then the matrix 



.9) A 



a —(3 
(3 a 



is in SU(2). Show that every A £ SU(2) can be expressed in the form ( |2.9| ) 
for a unique pair (a,/3) satisfying |a| 2 + |/3| 2 = 1. (Thus SU(2) can be 
thought of as the three-dimensional sphere S 3 sitting inside C 2 = K 4 . In 
particular, this shows that SU(2) is connected and simply connected.) 
9. The groups Sp (1;M), Sp (1; C), and Sp (1). Show that Sp (1;K) = SL (2;M), 
Sp (1; C) = SL (2; C), and Sp(l) = SU(2). 

10. The Heisenberg group. Determine the center Z(H) of the Heisenberg group 
H . Show that the quotient group H/Z(H) is abelian. 

11. Connectedness of SO(n). Show that SO(n) is connected, following the out- 
line below. 

For the case n = 1, there is not much to show, since a 1 x 1 matrix with 
determinant one must be [1]. Assume, then, that n > 2. Let e\ denote the 
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vector 



( 1 \ 





ei 



V o J 

in R™. Given any unit vector v E R", show that there exists a continuous 
path R(t) in SO(n) with i?(0) = / and = e\. (Thus any unit vector 

can be "continuously rotated" to ei.) 

Now show that any element R of SO(n) can be connected to an element 
of SO(n — 1), and proceed by induction. 

12. The polar decomposition of SL (n; R). Show that every element A of SL (n; R) 
can be written uniquely in the form A — RH, where R is in SO(n), and H 
is a symmetric, positive-definite matrix with determinant one. (That is, 
H tr = H, and (x, Hx) > for all x € R"). 

Hint: If A could be written in this form, then we would have 

A tr A = H tr R tr RH = HR^RH = H 2 

Thus H would have to be the unique positive-definite symmetric square root 
of A tr A. 

Note: A similar argument gives polar decompositions for GL(n;R), 
SL(rt;C), and GL(n;C). For example, every element A of SL(n;C) can 
be written uniquely as A = UH, with U in SU(n), and H a self-adjoint 
positive-definite matrix with determinant one. 

13. The connectedness of SL (rt;R). Using the polar decomposition of SL (n;R) 
(Ex. |l2|) and the connectedness of SO(n) (Ex. |ll|), show that SL(rt;R) is 
connected. 

Hint: Recall that if H is a real, symmetric matrix, then there exists a 
real orthogonal matrix R\ such that H — RxDR{ , where D is diagonal. 

14. The connectedness of GL(n;R) + . Show that GL(n;R) + is connected. 

15. Show that the set of translations is a normal subgroup of the Euclidean 
group, and also of the Poincare group. Show that (E(n)/translations) = 
0(n). 

16. Harder. Show that every Lie group homomorphism <ft from R to 5* 1 is of the 
form 4>(x) = e lax for some a € R. In particular, every such homomorphism 
is smooth. 
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Lie Algebras and the Exponential Mapping 



1. The Matrix Exponential 

The exponential of a matrix plays a crucial role in the theory of Lie groups. 
The exponential enters into the definition of the Lie algebra of a matrix Lie group 
(Section H below), and is the mechanism for passing information from the Lie alge- 
bra to the Lie group. Since many computations are done much more easily at the 
level of the Lie algebra, the exponential is indispensable. 

Let X be an n x n real or complex matrix. We wish to define the exponential 
of X, e x or expX, by the usual power series 

(3.1) e x = V 

y ' ^ m! 

ra=0 

We will follow the convention of using letters such as X and Y for the variable in 
the matrix exponential. 



Proposition 3.1. For any n x n real or complex matrix X, the series (3.1) 
converges. The matrix exponential e x is a continuous function of X . 

Before proving this, let us review some elementary analysis. Recall that the 
norm of a vector x in C n is defined to be 



||x|| = V ( x i x ) = v5Z 

This norm satisfies the triangle inequality 

\\x + y\\ < \\x\\ + \\y\ 
The norm of a matrix A is defined to be 



1 2 

Xi\ . 



\\A\\ = sup 



\Ax\\ 



Equivalently, \\A\\ is the smallest number A such that || Ax\\ < X \\x\\ for all x £ C". 

It is not hard to see that for any n x n matrix A, \\A\\ is finite. Furthermore, 
it is easy to see that for any matrices A, B 

(3.2) \\AB\\ < \\A\\ \\B\\ 

(3.3) \\A + B\\<\\A\\ + \\B\\. 

It is also easy to see that a sequence of matrices A m converges to a matrix A if and 



only if \\A m — A\\ — > 0. (Compare this with Definition 2.1 of Chapter 2.) 

A sequence of matrices A m is said to be a Cauchy sequence if \\A m — Ai\\ — > 

2 2 

as m, I —>■ oo. Thinking of the space of matrices as R™ or C" , and using a standard 
result from analysis, we have the following: 
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Proposition 3.2. If A m is a sequence of n x n real or complex matrices, and 
A m is a Cauchy sequence, then there exists a unique matrix A such that A m con- 
verges to A. 

That is, every Cauchy sequence converges. 

Now, consider an infinite series whose terms are matrices: 



(3.4) 
If 



A + Ax + A 2 



\\Am\\ < OO 

m=0 

then the series ([O]) is said to converge absolutely. If a series converges abso- 
lutely, then it is not hard to show th at th e partial sums of the series form a Cauchy 
sequence, and hence by Proposition 3.2, the series converges. That is, any series 
which converges absolutely also converges. (The converse is not true; a series of 
matrices can converge without converging absolutely.) 



PROOF. In light of (|3.2j), we see that 

HX m || < IIXI 



and hence 



oo 

E 

m=0 



X' 



< 



E- 

m=0 



1*1 



IIA'II 



< OO. 



Thus the series ([O]) converges absolutely, and so it converges. 

To show continuity, note that since X m is a continuous function of X, the 
partial sums of ( |3.ip are continuous. But it is easy to see that ( |3.1[ ) converges 
uniformly on each set of the form {||X|| < R}, and so the sum is again continuous. 

□ 



Proposition 3.3. Let X,Y be arbitrary n x n matrices. Then 

1. e° = /. 

2. e x is invertible, and (e x ) = e~ x . 

3. e^ a+ ^ x — e aX e^ x for all real or complex numbers a, (3. 



4. If XY = YX, then e x+Y = e x e Y = e Y e x . 

5. IfC is invertible, then e cxcl = Ce x C~ x . 

6. ||e*|| <e"*ll. 

It is not true in general that e x+Y = e x e Y , although by |]) it is true if X and 
Y commute. This is a crucial point, which we will consider in detail later. (See 
the Lie product formula in Section [| and the Baker-Campbcll-Hausdorff formula in 
Chapter 4.) 

PROOF. Point 0) is obvious. Points ||) and ||) are special cases of point ^). To 
verify point ^), we simply multiply power series term by term. (It is left to the 
reader to verify that this is legal.) Thus 

,-V = f/ + A- + f! + ..-)(/ + r + £ 
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Multiplying this out and collecting terms where the power of X plus the power of 
Y equals m, we get 



(3.5) 



e x e Y 



oc m 



EE 



T — V 



k\ (m — fc)! 



k\rm — k 



ml ' k\(m — k)\ 



X ky 



m=0 k=0 

Now because (and only because) X and Y commute, 

m! 



(x+Yr = Yl 



k^rra — k 



k=0 



k\(m — k)\ 



X Y 



and so (|3.5| ) becomes 



e x e Y 



y ±( X + Y) m = e x+Y . 



m— 



To prove ^), simply note that 

(cxc- 1 )'" = CX m C~ l 

and so the two sides of ^|) are the same term by term. 



Point ra) is evident from the proof of Proposition 3.1 



□ 



Proposition 3.4. Let X be a n x n complex matrix, and view the space of all 
n x n complex matrices as C . Then e is a smooth curve in C" , and 

d 



dt 



Xe 



e tx X. 



In particular, 



d 
di 



e tx =X. 



t=o 



Proof. Differentiate the power series for e tx term-by-term. (You might worry 
whether this is valid, but you shouldn't. For each (e tx )^. is given by a con- 
vergent power series in t, and it is a standard theorem that you can differentiate 
power series term- by-term.) □ 

2. Computing the Exponential of a Matrix 

2.1. Case 1: X is diagonalizable. Suppose that X is a n x n real or complex 
matrix, and that X is diagonalizable over C, that is, that there exists an invertible 
complex matrix C such that X = CDC^ 1 , with 



Ai 







D 



A n 

Observe that e D is the diagonal matrix with eigenvalues e Al , • • • ,e A ™, and so in 
light of Proposition 3.3, we have 



/ e> 







A" 



c 



cr 



:!() 
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Thus if you can explicitly diagonalize X, you can explicitly compute e x . Note that 
if X is real, then although C may be complex and the Aj's may be complex, e x 
must come out to be real, since each term in the series (3.1) is real. 
For example, take 

' -a 
a 



X = 



Then the eigenvectors of X are 



and 



, with eigenvalues — ia and ia, 



respectively. Thus the invertible matrix 



C 



1 % 
i 1 



maps the basis vectors 



and 







to the eigenvectors of X, and so (check) 



1 

J y 1 

C- l XC is a diagonal matrix D. Thus X = CDC- 1 : 

1 i \ f e-' la A f 1/2 -i/2 



„x 



i 1 







-i/2 1/2 



cos a — sin a 
sin a cos a 



Note that explicitly if X (and hence a) is real, then e is real. 

2.2. Case 2: X is nilpotent. An n x n matrix X is said to be nilpotent 

if X m = for some positive integer to. Of course, if X m = 0, then X 1 = for all 
I > m. In this case the series (3.1) which defines e x terminates after the first m 
terms, and so can be computed explicitly. 
For example, compute e tx , where 



X 




Note that 



and that X 3 = 0. Thus 



ac 
X 2 = | 




( I 



to 
1 

V o o 



£6 + -i 2 ac 

tc 
1 



2.3. Case 3: X arbitrary. A general matrix X may be neither nilpotent nor 
diagonalizable. However, it follows from the Jordan canonical form that X can be 
written (Exercise ||) in the form X = S + N with £ diagonalizable, N nilpotent, 
and SN — NS. (See Exercise 0.) Then, since N and S commute, 



^S+N 



e s e N 



and e and e can be computed as above. 
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X 



a b 
a 



For example, take 
Then 

X = 

The two terms clearly commute (since the first one is a multiple of the identity), 
and so 

' e a \ / 1 b \ _ ( e a e a b 
e a I { 1 j ~ { e a 












; 


a . 




2) 



e X = 



3. The Matrix Logarithm 

We wish to define a matrix logarithm, which should be an inverse function to 
the matrix exponential. Defining a logarithm for matrices should be at least as 
difficult as defining a logarithm for complex numbers, and so we cannot hope to 
define the matrix logarithm for all matrices, or even for all invertible matrices. We 
will content ourselves with defining the logarithm in a neighborhood of the identity 
matrix. 

The simplest way to define the matrix logarithm is by a power series. We recall 
the situation for complex numbers: 



Lemma 3.5. The function 

io gz = ± (-ir +l ^ 



m 

m—l 



is defined and analytic in a circle of radius one about z = 1 . 
For all z with \z — 1| < 1, 



e logz = z. 



For all u with \u\ < log 2, \e u — 1| < 1 and 

loge" = u. 

PROOF. The usual logarithm for real, positive numbers satisfies 
±log(l- X ) = T ±- = -(l + x + x 2 + ---) 
for \x\ < 1. Integrating term- by-term and noting that logl = gives 

log(l -a;) = -(a;+^ + ^ + -- -). 
Taking z = 1 — x (so that x = 1 — z), we have 

log* = -((l-*) + ^ + ^ + --- 



m 

m—l 



This series has radius of convergence one, and defines a complex analytic func- 
tion on the set {\z — 1| < 1}, which coincides with the usual logarithm for real z 
in the interval (0,2). Now, exp(logz) = z for z e (0,2), and by analyticity this 
identity continues to hold on the whole set {\z — 1| < 1}. 
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On the other hand, if |u| < log 2, then 

|2 

1 = 



<H + LL 



so that 

|e u -l| < e M -1< 1. 

Thus log(expu) makes sense for all such u. Since log(expu) — u for real u with 
|u| < log 2, it follows by analyticity that log(cxpw) = u for all complex numbers 
with \u\ < log 2. □ 

Theorem 3.6. The function 

.m+l (A ~ ^) ? " 



(3.6) logA=£(-l) 



m— 1 

is defined and continuous on the set of all nxn complex matrices A with \\A — I\\ < 
1. and log A is real if A is real. 
For all A with \\A - I\\ < 1, 



e 

x 



log A _ 



For all X with \\X\\ < log 2, ||e A - 1 < 1 and 

loge x = X. 

PROOF. It is easy to see that the series ([II]) converges absolutely whenever 
||A — J|| < 1. The proof of continuity is essentially the same as for the exponential. 



If A is real, then every term in the series (3.6) is real, and so log A is real. 

We will now show that exp(logA) = A for all A with \\A — I\\ < 1. We do this 
by considering two cases. 

Case 1: A is diagonalizable. 

Suppose that A = CDC" 1 , with D diagonal. Then A - I = CDC'- 1 - I = 
C{D - 7)C _1 . It follows that (A - I) m is of the form 

/ {zi ~l) m \ 

(A-i) m = c\ ■.. cr 1 , 

V o {z n - i) m J 

where zi, • • ■ , z n are the eigenvalues of A. 

Now, if \\A — I\\ < 1, then certainly \zi — 1| < 1 for i = 1, ■ ■ ■ ,n. (Think about 
it.) Thus 

logzi 



log z n 



and so by the Lemma 



glog Zi 



' ■ C | •.. | r ' A. 

e logz " 



Case 2: A is not diagonalizable. 
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If A is not diagonalizable, then, using the Jordan canonical form, it is not 
difficult to construct a sequence A m of diagonalizable matrices with A m — > A. (See 
Exercise^.) If \\A — I\\ < 1, then \\A m — I\\ < 1 for all sufficiently large m. By Case 
1, exp(log A m ) — A m , and so by the continuity of exp and log, exp(logA) = A. 

Thus we have shown that exp(log A) = A for all A with \\A — I\\ < 1. Now, the 
same argument as in the complex case shows that if \\X\\ < log 2, then ||e x — / || < 
1. But then the same two-case argument as above shows that log(expX) = X for 
all such X. □ 



Proposition 3.7. There exists a constant c such that for all nx n matrices B 



with \\B\\ < | 



Proof. Note that 



\log(I + B) - B\\ <c||B|| 2 



oo om °° Dm— 2 

io g (j + b) - b = y (-i) m — = b 2 y (-i) m — 

m—2 ra—1 

so that 

oo (l\ m 

2 



llog^ + ^-iJll^llBll 2 ^ ^ ! 



m 

m=2 

This is what we want. □ 



Proposition 3.8. LetX be anynxn complex matrix, and let C m be a sequence 
of matrices such that \\C m \\ < con l t ' ■ Then 



lim 



/ + — + C„ 

m 



Proof. The expression inside the brackets is clearly tending to / as m —> oo, 
and so is in the domain of the logarithm for all sufficiently large m. Now 



X 



log [I + — + C m = — + C, 



X 



where E m is an error term which, by Proposition 3.7 satisfies kE" m || < c 



Cn 



< 



£2H|t,. But then 



and so 



I + — + G n 

m 



— + C n 

771 



exp 



X 



exp (X + mC m + mE m ) 



Since both C m and E m are of order -jj, we obtain the desired result by letting 
m — ► oo and using the continuity of the exponential. □ 
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4. Further Properties of the Matrix Exponential 

In this section we give three additional results involving the exponential of a 
matrix, which will be important in our study of Lie algebras. 

Theorem 3.9 (Lie Product Formula). Let X andY benxn complex matrices. 
Then 



e 



x+Y 



( 2L i\ m 

hm le m e m J 



This theorem has a big brother, called the Trotter product formula, which gives 
the same result in the case where X and Y are suitable unbounded operators on an 
infinite-dimensional Hilbert space. The Trotter formula is described, for example, 
in M. Reed and B. Simon, Methods of Modern Mathematical Physics, Vol. I, VIII. 8. 

Proof. Using the power series for the exponential and multiplying, we get 

x. x . X Y „ 

e™ e m = 1 H 1 h C m , 

m m 

where (check!) ||C m || < co m t t ' ■ Since e^e™ — > I as m — ► oo, e^e™ is in the 
domain of the logarithm for all sufficiently large m. But 



/ X Y \ ( X Y 

log (e«e» = log (1+ — + — + C„ 
V / \ m m 

X Y 



~H Cm V En 

m m 



where by Proposition p. 7| ||C m || < const. ^ + — + C m \\ < co "| • Exponentiating 
the logarithm gives 



X Y 



and 



:x y 

e m e m = exp 1 h C m + E„ 

m m 



( X_ Y_ \ m 

le m e m ) = exp (V + Y + mC m + rnE m ) 



Since both C m and £" m are of order -\, we have (using the continuity of the 
exponential) 

( x Y \ m 
e - e - =cxp(V + V) 

which is the Lie product formula. □ 
Theorem 3.10. Let X be an n x n real or complex matrix. Then 



det (e x ) 



X\ _ e trace(X) 



Proof. There are three cases, as in Section g. 

Case 1: A is diagonalizable. Suppose there is a complex invertible matrix C 
such that 



X = C 



Ai 
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Then 



3 Ai 



= c 



cr 



Thustrace(X) = £Aj, and det(e x ) = l\e x > = A \ (Recall that trace^CC -1 ) = 
trace(D).) 

Case 2: X is nilpotent. If X is nilpotent, then it cannot have any non-zero 
eigenvalues (check!), and so all the roots of the characteristic polynomial must be 
zero. Thus the Jordan canonical form of X will be strictly upper triangular. That 
is, X can be written as 



X = C 



( o 



V o 



c-\ 







In that case (it is easy to see) e will be upper triangular, with ones on the diagonal: 



C 



I 1 



V o 



Thus if X is nilpotent, trace(X) = 0, and det(e x ) = 1. 

Case 3: X arbitrary. As pointed out in Section pi every matrix X can be 
written as the sum of two commuting matrices S and N, with S diagonalizable 
(over C) and N nilpotent. Since S and N commute, e x = e e N . So by the two 
previous cases 

det (e x ) = det (e s ) det (e*) = e tracc ( s > e trace W = e tTac < x \ 
which is what we want. □ 



Definition 3.11. A function A : K — > GL(n;C) is called a one-parameter 
group if 

1. A is continuous, 

2. A(0) - I, 

3. A(t + s) = A(t)A(s) for all t,s£R. 



Theorem 3.12 (One-parameter Subgroups). If A is a one-parameter group in 
GL(n;C) ; then there exists a unique nx n complex matrix X such that 



IX 



A(t) = e 

By taking n = 1, and noting that GL(1;C) = C*, this Theorem provides an 
alternative method of solving Exercise |l6| in Chapter 2. 

Proof. The uniqueness is immediate, since if there is such an X, then X = 
^| t _ Q j4(i). So we need only worry about existence. 

The first step is to show that A(t) must be smooth. This follows from Proposi- 
tion [2.16 in Chapter 2 (which we did not prove), but we give a self-contained proof. 
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Let f(s) be a smooth real-valued function supported in a small neighborhood of 
zero, with f(s) > and J f(s)ds — 1. Now look at 



(3.7) B(t) = J A(t + s)f(s)ds. 

Making the change-of-variable u = t + s gives 

B(t) = { A(u)f(u-t)du. 



It follows that Bit) is differentiable, since derivatives in the t variable go onto /, 
which is smooth. 

On the other hand, if we use the identity A(t + s) = A(t)A(s) in ( |3.7D , we have 

B(t)=A(t) J A(s)f(s)ds. 

Now, the conditions on the function /, together with the continuity of A, guarantee 
that J A(s)f(s) ds is close to A(0) — I, and hence is invertible. Thus we may write 

(3.8) A(t) = B{t) [ I A(s)f(s)ds ' 



Since B (t) is smooth and J A(s)f(s)ds is just a constant matrix, this shows that 
A (t) is smooth. 

Now that A(t) is known to be differentiable, we may define 

Our goal is to show that A(t) — e tx . Since A(t) is smooth, a standard calculus 
result (extended trivially to handle matrix-valued functions) says 

\\A(t) - (I + tX)\\ < const.i 2 . 
It follows that for each fixed t, 

A (—) =I+±X + 0(-^). 
Then, since A is a one-parameter group 

Mt) = [A(±)r = [i+±x+o(^)r. 



Letting m — > oo and using Proposition 3.8 from Section |3j shows that A(t) = 
e tx . ' □ 

5. The Lie Algebra of a Matrix Lie Group 

The Lie algebra is an indispensable tool in studying matrix Lie groups. On the 
one hand, Lie algebras are simpler than matrix Lie groups, because (as we will see) 
the Lie algebra is a linear space. Thus we can understand much about Lie algebras 
just by doing linear algebra. On the other hand, the Lie algebra of a matrix Lie 
group contains much information about that group. (See for example, Proposition 
3.23 in Section [t], and the Baker-Campbell-Hausdorff Formula (Chapter 4).) Thus 



many questions about matrix Lie groups can be answered by considering a similar 
but easier problem for the Lie algebra. 

Definition 3.13. Let G be a matrix Lie group. Then the Lie algebra of G, 

denoted Q, is the set of all matrices X such that e tx is in G for all real numbers t. 
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Note that even if G is a subgroup of GL(n; C) we do not require that e tx be in 
G for all complex t, but only for all real t. Also, it is definitely not enough to have 
just e x in G. That is, it is easy to give an example of an X and a G such that 
e x E G but e tx ^ G for some values of t. Such an X is not in the Lie algebra of G. 

It is customary to use lower case Gothic (Fraktur) characters such as g and f) 
to refer to Lie algebras. 

5.1. Physicists' Convention. Physicists are accustomed to considering the 
map X — > e lX instead of X — > e x . Thus a physicist would think of the Lie 
algebra of G as the set of all matrices X such that e ltx € G for all real t. In 
the physics literature, the Lie algebra is frequently referred to as the space of 
"infinitesimal group elements." See Brocker and torn Dieck, Chapter I, 2.21. The 
physics literature does not always distinguish clearly between a matrix Lie group 
and its Lie algebra. 

Before examining general properties of the Lie algebra, let us compute the Lie 
algebras of the matrix Lie groups introduced in the previous chapter. 

5.2. T he g eneral linear groups. If X is any nx n complex matrix, then by 
Proposition |3.3| , e tx is invertible. Thus the Lie algebra of GL(n;C) is the space of 
all n x n complex matrices. This Lie algebra is denoted gl(n; C). 

If X is any nx n real matrix, then e tx will be invertible and real. On the other 
hand, if e tx is real for all real t, then X = 4i L „ e tx will also be real. Thus the 

dt\t=0 

Lie algebra of GL(n;R) is the space of all n x n real matrices, denoted gl(n;R). 

Note that the preceding argument shows that if G is a subgroup of GL(n;M), 
then the Lie algebra of G must consist entirely of real matrices. We will use this 
fact when appropriate in what follows. 



5.3. The special linear groups. Recall Theorem 3.10: det (e x ) — e traccX . 
Thus if traceX = 0, then det (e tx ) = 1 for all real t. On the other hand, if X is 
any n x n matrix such that det (e tx ) = 1 for all t, then e (*)( tracoA ') = \ f or & \\ i 
This means that (<)(traceX) is an integer multiple of 27ri for all t, which is only 
possible if traced = 0. Thus the Lie algebra of SL (n; C) is the space of all n x n 
complex matrices with trace zero, denoted sl(n; C). 

Similarly, the Lie algebra of SL (n; K) is the space of all n x n real matrices with 
trace zero, denoted si (n;R). 

5.4. The unitary groups. Recall that a matrix U is unitary if and only if 
U* = U^ 1 . Thus e tx is unitary if and only if 

(3.9) (e tx )* = (e^y 1 = e~ tx . 

But by taking adjoints term-by-term, we see that (e tx )* — e tx , and so ( |3.9|) 
becomes 

(3.10) e tx * =e~ tx . 



Clearly, a sufficient condition for ( 3.1 0| ) to hold is that X* = —X. On the other 



hand, if (3.10) holds for all t, then by differentiating at t = 0, we see that X* = —X 
is necessary. 

Thus the Lie algebra of U (n) is the space of all nx n complex matrices X such 
that X* = —X, denoted u(ra). 
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By combining the two previous computations, we see that the Lie algebra of 
SU(n) is the space of all n x n complex matrices X such that X* = —X and 
trace = 0, denoted su(n). 



5.5. The orthogonal groups. The identity component of 0(n) is just SO(n). 



Since (Proposition 3.14) the exponential of a matrix in the Lie algebra is automat- 
ically in the identity component, the Lie algebra of O(n) is the same as the Lie 
algebra of SO(n). 

Now, an n x n real matrix R is orthogonal if and only if R tr — R . So, given 
an n x n real matrix X, e tx is orthogonal if and only if (e tx ) tr = (e tx )~ 1 , or 

(3.11) e txtr = e~ tx . 



Clearly, a sufficient condition for this to hold is that X tr = —X. If fl3.ll ) holds for 
all t, then by differentiating at t — 0, we must have X tr = —X. 

Thus the Lie algebra of 0(n), as well as the Lie algebra of SO(n), is the space of 
all n x n real matrices X with X tr — —X, denoted so(n). Note that the condition 
X tr = —X forces the diagonal entries of X to be zero, and so explicitly the trace 
of X is zero. 

The same argument shows that the Lie algebra of SO(n; C) is the space of n x n 
complex matrices satisfying X tr — —X, denoted so(n;C). This is not the same as 
su(n). 



5.6. The generalized orthogonal groups. A matrix A is in 0(n; k) if and 
only if A tr gA = g, where g is the (n + k) x (n + k) diagonal matrix with the first 
n diagonal entries equal to one, and the last k diagonal entries equal to minus one. 
This condition is equivalent to the condition g~ 1 A tr g = A^ 1 , or, since explicitly 
g~ x = g, gA tr g = A^ 1 . Now, if X is an (n + k) x (n + k) real matrix, then e tx is 
in 0(n; k) if and only if 

tX tr tgX tr g -tX 

ge g — e y y — e 

This condition holds for all real t if and only if gX tr g = —X. Thus the Lie 
algebra of 0(n; k), which is the same as the Lie algebra of SO(n; k), consists of all 
(n + k) x (n + k) real matrices X with gX tr g = —X. This Lie algebra is denoted 
so(n; k). 

(In general, the group SO(n; k) will not be connected, in contrast to the group 
SO(n). The identity component of SO(n; fc), which is also the identity component 
of 0(n;k), is denoted SO(n;k)]. The Lie algebra of SO(n;fc)/ is the same as the 
Lie algebra of SO(n; fc).) 



5.7. The symplectic groups. These are denoted sp(n;R), sp(n;C), and 
sp (n) . The calculation of these Lie algebras is similar to that of the generalized 
orthogonal groups, and I will just record the result here. Let J be the matrix in 
the definition of the symplectic groups. Then sp (n; K) is the space of 2n x 2n real 
matrices X such that JX tr J = X, sp(n; C) is the space of 2n x 2n complex matrices 
satisfying the same condition, and sp (n) =sp(n; C)flu (2n) . 
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5.8. The Heisenberg group. Recall the Heisenberg group H is the group of 
all 3 x 3 real matrices A of the form 



(3.12) 

Recall also that in Section 
the form 

(3.13) 




.4 



Case 2, we computed the exponential of a matrix of 



X 




and saw that e x was in H . On the other hand, if X is any matrix such that e tx is 
of the form ( [3.12 ) , then all of the entries of X = ^ L_ Q e tx which are on or below 
the diagonal must be zero, so that X is of form (3.13). 

Thus the Lie algebra of the Heisenberg group is the space of all 3 x 3 real 
matrices which are strictly upper triangular. 

5.9. The Euclidean and Poincare groups. Recall that the Euclidean group 
E(n) is (or can be thought of as) the group of (n + 1) x (n + 1) real matrices of the 
form 



R 



\ 



1 J 



with R S 0(n). Now if X is an (n + 1) x (n + 1) real matrix such that e is in 
E(n) for all i, then X — g|L_ e tX must be zero along the bottom row: 

/ 2/1 \ 



(3.14) 



X 



Y 



V o 



Our goal, then, is to determine which matrices of the form (3.14) are actually 
in the Lie algebra of the Euclidean group. A simple computation shows that for 
n > 1 



V o 



Y 



yn Y n ~ l y 



\ 



\0 ■ 



where y is the column vector with entries yi, ■ ■ ■ ,y n - It follows that if X is as in 
( |3.14 ), then e tx is of the form 

/ 



J-X 



AY 



\ 



I) 
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Now, we have already established that e tY is in O(n) for all t if and only if 
ytr — _y_ Thus we see that the Lie algebra of E(n) is the space of all (n+l) x (n+1) 
real matrices of the form (3.14) with Y satisfying Y tr = —Y. 

A similar argument shows that the Lie algebra of P(n; 1) is the space of all 
(n + 2) x (n + 2) real matrices of the form 

/ 



Y 



V o 



Vl \ 



with Y 6 so(n; 1) 



6. Properties of the Lie Algebra 



We will now establish various basic properties of the Lie algebra of a matrix 
Lie group. The reader is invited to verify by direct calculation that these general 
properties hold for the examples computed in the previous section. 

Proposition 3.14. Let G be a matrix Lie group, and X an element of its Lie 
ra. Then e x is an element of the identity component of G. 



Proof. By definition of the Lie algebra, e tx lies in G for all real t. But as t 
varies from to 1, e tx is a continuous path connecting the identity to e^. □ 



Proposition 3.15. Let G be a matrix Lie group, with Lie algebra q. Let X be 
an element of 2, and A an element of G. Then AX A -1 is in g. 



Proof. This is immediate, since by Proposition |3.3 

e t(AXA-*) =Ae t* A -\ 

and Ae tx A~ l e G. 



□ 



Theorem 3.16. Let G be a matrix Lie group, q its Lie algebra, and X,Y ele- 
ments of g. Then 

1. sX € g f or a M rea l numbers s, 

2. X + Y eg, 

3. XY — YX e g . 

If you are following the physics convention for the definition of the Lie algebra, 
then condition || should be replaced with the condition —i (XY — YX) € g. 

PROOF. Point |l| is immediate, since e*( sX ) = e^ ts ^ x , which must be in G if X is 
in q. Point ||is easy to verify if X and Y commute, since then e*( x+y ) = e tx e tY . If 
X and Y do not commute, this argument does not work. However, the Lie product 
formula says that 



e t(X+Y) = Um ( e tX/m e tY/m 




Because X and Y are in the Lie algebra, e tx ^ m and e tY l m are in G, as is ( e t x /m e tY/m\ 
since G is a group. But now because G is a matrix Lie group, the limit of things 
in G must be again in G, provided that the limit is invertible. Since e^ x+Y ^ is 
automatically invertible, we conclude that it must be in G. This shows that X + Y 
is in g. 



6. PROPERTIES OF THE LIE ALGEBRA 



41 



Now for point |^. Recall (Proposition 3.4) that j[t\ t _ Q e tX = X. It follows that 



sit— o etX Y = XY, and hence by the product rule (Exercise |l|) 

' fY Ye- tx ) = (XY)e° + (e°Y)(-X) 
= XY - YX. 



dt 



But now, by Proposition 3.15 , e tx Ye~ tx is in g for all t. Since we have (by points 
[j] and ||) established that g is a real vector space, it follows that the derivative of 
any smooth curve lying in g must be again in g. Thus XY — YX is in g. □ 

Definition 3.17. Given two n x n matrices A and B, the bracket (or com- 
mutator) of A and B is defined to be simply 

[A, B] = AB- BA. 



According to Theorem 3.16, the Lie algebra of any matrix Lie group is closed 
under brackets. 

The following very important theorem tells us that a Lie group homomorphism 
between two Lie groups gives rise in a natural way to a map between the corre- 
sponding Lie algebras. In particular, this will tell us that two isomorphic Lie groups 
have "the same" Lie algebras. (That is, the Lie algebras are isomorphic in the sense 
of Section g.) See Exercise ^|. 

Theorem 3.18. Let G and H be matrix Lie groups, with Lie algebras g and (), 
respectively. Suppose that <f> : G — > H be a Lie group homomorphism. Then there 
exists a unique real linear map (j) '■ 8 — ► f) such that 

0( e *) = e *W 

for all leg. The map <f> has following additional properties 

1. ^(AXA- 1 ) = 4>(A)^(X)4>(A)^ 1 , for all X € g, A € G. 

2. $QX,Y])= \4>{X),4>{Y)\, for all X,Y E g. 

3. f t \ t=0 ^e tx ),forallXeg. 

If G, H , and K are matrix Lie groups and <j> : H — > K and ip : G — > H are Lie 
group homomorphisms, then 

(j) o ip = (j) o ip. 

In practice, given a Lie group homomorphism (f>, the way one goes about com- 
puting 4> is by using Property ^. Of course, since <f> is (real) linear, it suffices to 
compute on a basis for g. In the language of differentiable manifolds, Property 
H says that (f> is the derivative (or differential) of <j) at the identity, which is the 
standard definition of <f>. (See also Exercise |l9| .) 

A linear map with property (||) is called a Lie algebra homomorphism. (See 
Section |[) This theorem says that every Lie group homomorphism gives rise to a 
Lie algebra homomorphism. We will see eventually that the converse is true under 
certain circumstances. Specifically, suppose that G and H are Lie groups, and 
4> : g — * fj is a Lie algebra homomorphism. If G is connected and simply connected, 
then there exists a unique Lie group homomorphism <f> : G — > H such that <p and <p 
are related as in Theorem [3.1g . 
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Proof. The proof is similar to the proof of Theorem 3.16. Since is a con- 
tinuous group homomorphism, (j>(e tx ) will be a one-parameter subgroup of H, for 
each leg. Thus by Theorem 3. 12[ , there is a unique Z such that 

(3.15) cj)(e tx )=e tz 

for all tel. This Z must lie in f) since e tz = <j) (e tx ) e H. 

We now define 4>{X) — Z, and check in several steps that (j> has the required 
properties. 

Step 1: (j){e x ) = e^ x \ 

This follows from (3.15) and our definition of <f>, by putting t = 1. 
Step 2: <£(sX) = s^(X) /or a// s € R. 

This is immediate, since if cf>(e tx ) — e tz , then cf>(e tsX ) — e tsZ . 

Step 3: 4>{X + Y) = 4>{X) + 4>{Y). 
By Steps 1 and 2, 

e t4>(X+Y) _ e 4,[t(X+Y)] _ f e t{X+Y)^ _ 

By the Lie product formula, and the fact that </> is a continuous homomorphism: 

= </>( lim (V*/«V y / m " 



lim 

m — *oo 



4> (e tx / m ) 0(e* F / m ) 



But then we have 

e mX+Y) = Hm / t4,(X)/m e t4>(Y)/m\ m = e «(0(X)+0(y)) _ 
m — >oo V / 

Differentiating this result at t = gives the desired result. 

Step 4: ^(AXA- 1 ) = <)){A)4>{X)4)(A)^ 1 . 
By Steps 1 and 2, 

expt^AXA^ 1 ) = exp^tAXA- 1 ) = <j> (exptAXA^ 1 ) . 
Using a property of the exponential and Step 1, this becomes 

expt^AXA- 1 ) = <f> (Ae tx A^ 1 ) = ^{A^e^^Ay 1 

= ( j){A)e t ^ xs >4>{A)- 1 . 
Differentiating this at t — gives the desired result. 

Step 5: 4>{[X,Y])= [^(X)^(Y)]. 

Recall from the proof of Theorem 3.16 that 

\X Y] = —I e tx Ye~ tx 
L ' J dt\t=0 

Hence 

H\X,y\) = m t \ t ^ x Ye-^) = i\ t= J(e tX Ye-^) 

where we have used the fact that a derivative commutes with a linear transforma- 
tion. 
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But then by Step 4, 

= iLo et * (x) ^ Y ) e ~ tkx) 

= 0(A), 0(F) 

Step 6: j(X)= f t \ t= J{e tx ). 
This follows from ( 3.1 5| ) and our definition of i 



Step 7: 4> is the unique real-linear map such that (j){e x ) = e^ x '. 
Suppose that ip is another such map. Then 

so that 

Thus by Step 6, ip coincides with <p. 

Step 8: (j) o i/j = (j) o ip. 
For any leg, 

Thus f^p(X) = 4>o 4>{X). □ 

Definition 3.19 (The Adjoint Mapping). Let G be a matrix Lie group, with 
Lie algebra g. Then for each A £ G, define a linear map AdA : g — > g &?/ i/ie 
/orma/a 

AdA(A) = AAA -1 . 
We will let Ad denote the map A — » AdA. 

Proposition 3.20. Let G be a matrix Lie group, with Lie algebra g. Then for 
each A € G, AdA is an invertible linear transformation of g with inverse AdA -1 , 
and Ad : G — > GL(g) is a group homomorphism. 



PROOF. Easy. Note that Proposition 3.15| guarantees that AdA(A) is actually 



in g for all leg. □ 

Since g is a real vector space with some dimension k, GL(g) is essentially the 
same as GL(fc;R). Thus we will regard GL(g) as a matrix Lie group. It is easy to 
show that Ad : G — > GL(g) is continuous, and so is a Lie group homomorphism. By 



Theorem 3.18, there is an associated real linear map Ad from the Lie algebra of G 



to the Lie algebra of GL(g), i.e., from g to gl(g), with the property that 

~ Kdx = Ad (e x ) . 



e 
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Proposition 3.21. Let G be a matrix Lie group, let g its Lie algebra, and let 
Ad : G — ► GL(g) be the Lie group homomorphism defined above. Let Ad : g — »g7(g) 
be the associated Lie algebra map. Then for all 1,7 eg 

AdX(Y) = [X,Y]. 



Proof. Recall that by Theorem |3.1q , Ad can be computed as follows: 

— I 

dt lt=0 ' 



AdA= f t \Ad(e tx ). 



Thus 

Ad*Cn = TtLo Acl(e tX )0n = i\ t=Q e tX Ye- tx 
= [X,Y] 

which is what we wanted to prove. See also Exercise |l3|. □ 

7. The Exponential Mapping 

Definition 3.22. If G is a matrix Lie group with Lie algebra g, then the ex- 
ponential mapping for G is the map 

exp : g — > G. 

In general the exponential mapping is neither one-to-one nor onto. Neverthe- 
less, it provides an crucial mechanism for passing information between the group 
and the Lie algebra. The following result says that the exponential mapping is 
locally one-to-one and onto, a result that will be essential later. 

Theorem 3.23. Let G be a matrix Lie group with Lie algebra g. Then there 
exist a neighborhood U of zero in g and a neighborhood V of I in G such that the 
exponential mapping takes U homeomorphically onto V . 

Proof. We follow the proof of Theorem 1.3.11 in Brocker and torn Dieck. In 
view of what we have proved about the matrix logarithm, we know this result for 
the case of GL(n;C). To prove the general case, we consider a matrix Lie group 
G < GL(n; C), with Lie algebra g. 

Lemma 3.24. Suppose g n are elements ofG, and that g n — > /. LetY n = \ogg n , 
which is defined for all sufficiently large n. Suppose Y n / \\Y n \\ — ► Y € gl(n;C). 
Then Y eg. 

Proof. To show that Y e g, we must show that exptY G G for all t <E R. As 
n — > oo, (t/ \\Y n \\) Y n — > tY. Note that since g n — > /, Y n — > 0, and so ||y„|| — > 0. 
Thus we can find integers m„ such that (m n H^nll) — > t. Then exp(m„F„) = 
exp [(m„ ||r„||) (Y n / \\Y n \\)} - exp {tY). But exp {m n Y n ) = exp (K„) m " = {g n ) m " e 
G, and G is closed, so exp (tY) EG. □ 

We think of gl (n;C) as C" 2 R 2 ™ 2 . Then g is a subspace of R 2 " 2 . Let D 
denote the orthogonal complement of g with respect to the usual inner product on 
R 2 ™ . Consider the map <I> : g © D — * GL(n; C) given by 

${X,Y) =e x e Y . 

Of course, we can identify g © D with R 2 " . Moreover, GL(n; C) is an open subset 
of gl (n; C) = R 2 ™ . Thus we can regard $ as a map from R 2n to itself. 
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Now, using the properties of the matrix exponential, we see that 

d 



dt 
d 
~dt 



$ (tx, 0) = X 
t=o 



$(o,tr) = y. 



=o 



This shows that the derivative of <I> at the point € R 2 ™ is the identity. (Recall 
that the derivative at a point of a function from R 2 ™ 2 to itself is a linear map of 
R 2 ™ to itself, in this case the identity map.) In particular, the derivative of $ at 
is invertiblc. Thus the inverse function theorem says that $ has a continuous local 
inverse, defined in a neighborhood of I. 

Now let U be any neighborhood of zero in g. I want to show that exp (U) 
contains a neighborhood of I in G. Suppose not. Then we can find a sequence 
g n G G with g n — > / such that no g n is in exp (U). Since <E> is locally invertiblc, 
we can write g n (for large n) uniquely as g n = exp (X n ) exp (Y n ), with X n £ g and 
Y n G D. Since g n — > / and $ _1 is continuous, X n and Y n tend to zero. Thus (for 
large n), X n £ U. So we must have (for large n) Y n ^ 0, otherwise g n would be in 
exp([/). 

Let g„ = exp(Y"„) = exp (— X n ) g n . Note that g n € G and <? n — * /. Since the 
unit ball in D is compact, we can choose a subsequence of {Y n } (still called {1^}) 
so that Y n / \\Y n \\ converges to some Y £ D, with ||y|| = 1. But then by the Lemma, 
7 eg! This is a contradiction, because D is the orthogonal complement of Q. 

So for every neighborhood U of zero in g, exp (U) contains a neighborhood of 
the identity in G. If we make U small enough, then the exponential will be one-to- 
one on U. (The existence of the matrix logarithm implies that the exponential is 
one-to-one near zero.) Let log denote the inverse map, defined on exp (J7). Since U 
is compact, and exp is one-to-one and continuous on U, log will be continuous. (This 
is a standard topological result.) So take V to be a neighborhood of / contained in 
exp (U), and let U' = exp 1 (V) n U. Then U' is open and the exponential takes 
U' homeomorphically onto V. □ 



Definition 3.25. IfU and V are as in Proposition 3.2c, then the inverse map 
exp -1 : V — > g is called the logarithm for G. 

COROLLARY 3.26. If G is a connected matrix Lie group, then every element A 
of G can be written in the form 

(3.16) A = e Xl e X2 ■ ■ ■ e x " 

for some X\, Xi, ■ ■ ■ X n in g. 

Proof. Recall that for us, saying G is connected means that G is path- 
connected. This certainly means that G is connected in the usual topological sense, 
namely, the only non-empty subset of G that is both open and closed is G itself. 



So let E denote the set of all A e G that can be written in the form (3.16). In light 
of the Proposition, E contains a neighborhood V of the identity. In particular, E 
is non-empty. 

We first claim that E is open. To see this, consider A € E. Then look at the 
set of matrices of the form AB, with B £ V. This will be a neighborhood of A. But 
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every such B can be written as B = e x and A can be written as A = e Xl e X2 ■ ■ ■ e Xn , 
so AB = e Xl e X2 ■ ■ ■ e x "e x . 

Now we claim that E is closed (in G). Suppose A E G, and there is a sequence 
A n G E with A n — ^ A. Then AA' 1 — > /. Thus we can choose some no such 
that AA~l G V. Then AA~^ = e x and A = A no e x . But by assumption, A no = 
e x i e x 2 . . . e x ", so A = e Xl e X2 ■ ■ ■ e Xn e x . Thus A E E, and E is closed. 

Thus E is both open and closed, so E = G. □ 



8. Lie Algebras 

Definition 3.27. A finite- dimensional real or complex Lie algebra is a 

finite- dimensional real or complex vector space g, together with a map [ ] from gxg 
into g, with the following properties: 

1. [ ] is bilinear. 

2. [X, Y] = — [Y, X] for all X, Y G 0. 

3. [X, [Y, Z]\ + [Y, [Z, X]] + [Z, [X, Y}] = for all X,Y,Zeg. 

Condition || is called the Jacobi identity. Note also that Condition || implies 
that [X, X] = for all X € 0. The same three conditions define a Lie algebra over 
an arbitrary field F, except that if F has characteristic two, then one should add the 
condition [X, X] = 0, which doesn't follow from skew-symmetry in characteristic 
two. We will deal only with finite-dimensional Lie algebras, and will from now on 
interpret "Lie algebra" as "finite-dimensional Lie algebra." 

A Lie algebra is in fact an algebra in the usual sense, but the product operation 
] for this algebra is neither commutative nor associative. The Jacobi identity 
should be thought of as a substitute for associativity. 

Proposition 3.28. The space gl(n;M) of all n x n real matrices is a real Lie 
algebra with respect to the bracket operation [A, B] = AB — BA. The space gl(n; C) 
of all n x n complex matrices is a complex Lie algebra with respect to the analogous 
bracket operation. 

Let V is a finite- dimensional real or complex vector space, and let g\{V) denote 
the space of linear maps ofV into itself. Then g\(V) becomes a real or complex Lie 
algebra with the bracket operation [A, B] = AB — BA. 

Proof. The only non-trivial point is the Jacobi identity. The only way to 
prove this is to write everything out and see, and this is best left to the reader. 
Note that each triple bracket generates four terms, for a total of twelve. Each of 
the six orderings of {X, Y, Z} occurs twice, once with a plus sign and once with a 
minus sign. □ 



Definition 3.29. A subalgebra of a real or complex Lie algebra g is a sub- 
space \) of Q such that [H±, H2] G f) for all Hi, H2 G f). If Q is a complex Lie algebra, 
and t) is a real subspace of g which is closed under brackets, then \) is said to be a 
real subalgebra of g. 

If g and f) are Lie algebras, then a linear map 4> '■ g — > t) is called a Lie algebra 
homomorphism if <fi([X, Y]) = \<j>(X), <j>(Y)] for all X, Y G g. If in addition <fr is 
one-to-one and onto, then <f> is called a Lie algebra isomorphism. A Lie algebra 
isomorphism of a Lie algebra with itself is called a Lie algebra automorphism. 
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A subalgebra of a Lie algebra is again a Lie algebra. A real subalgebra of a 
complex Lie algebra is a real Lie algebra. The inverse of a Lie algebra isomorphism 
is again a Lie algebra isomorphism. 

Proposition 3.30. The Lie algebra q of a matrix Lie group G is a real Lie 
ra. 



Proof. By Theorem 3.16 , g is a real subalgebra of gl(n; C) complex matrices, 
and is thus a real Lie algebra. □ 



Theorem 3.31 (Ado). Every finite-dimensional real Lie algebra is isomorphic 
to a subalgebra o/gl(n;R). Every finite-dimensional complex Lie algebra is isomor- 
phic to a (complex) subalgebra o/gl(n;C). 

This remarkable theorem is proved in Varadarajan. The proof is well beyond 
the scope of this course (which is after all a course on Lie groups), and requires 
a deep understanding of the structure of complex Lie algebras. The theorem tells 
us that every Lie algebra is (isomorphic to) a Lie algebra of matrices. (This is 
in contrast to the situation for Lie groups, where most but not all Lie groups are 
matrix Lie groups.) 

Definition 3.32. Let g be a Lie algebra. For X G g, define a linear map 
a,dX : g — > g by 

adX(Y) = [X,Y]. 

Thus "ad" (i.e., the map X — > &dX) can be viewed as a linear map from g into 
gl(fl), where gl(g) denotes the space of linear operators from g to g. 

Since &dX(Y) is just [X, Y], it might seem foolish to introduce the additional 
"ad" notation. However, thinking of [X, Y] as a linear map in Y for each fixed X, 
gives a somewhat different perspective. In any case, the "ad" notation is extremely 
useful in some situations. For example, instead of writing 

[X, [X, [X, [X,Y]}]] 

we can now write 

(zdXf(Y). 

This kind of notation will be essential in Section [y. 

Proposition 3.33. If g is a Lie algebra, then 

ad[A, Y] = adAadF - adYadX = [acLY, adY]. 
That is, ad: g — ► gl(g) is a Lie algebra homomorphism. 

PROOF. Observe that 

ad[X,Y](Z) = [[X,Y],Z] 

whereas 

[adX, adY] (if) = [X, [Y, Z]] - [Y, [X, Z\\. 

So we require that 

[[X,Y] ) Z] = [X, [Y,Z]] - [Y, [A, Z}} 



48 



3. LIE ALGEBRAS AND THE EXPONENTIAL MAPPING 



or equivalently 

0= [X, [Y,Z]} + [Y,[Z,X}] + [Z, [X,Y]} 
which is exactly the Jacobi identity. □ 

Recall that for any leg, and any 4e G, we define 

AdA(X) = AX A- 1 

and that Ad: G — » GL(g) is a Lie group homomorphism. We showed (Proposition 



3.21 ) that the associated Lie algebra homomorphism Ad : g — > gl(g) is given by 

AdX(Y) = [X,Y]. 
In our new notation, we may say 

Ad = ad 

By the defining property of Ad, we have the following identity: For all X G g, 
(3.17) Ad(e x )=e adX . 



Note that both sides of ( 3.17 ) are linear operators on the Lie algebra g. This is 
an important relation, which can also be verified directly, by expanding out both 
sides. (See Exercise |l^.) 

8.1. Structure Constants. Let g be a finite-dimensional real or complex Lie 
algebra, and let X\, • • • , X n be a basis for g (as a vector space). Then for each 
[Xi,Xj] can be written uniquely in the form 



[Xi,Xj] — cjjkXk. 



fc=l 

The constants cy*. are called the structure constants of g (with respect to the 
chosen basis). Clearly, the structure constants determine the bracket operation on 
g. In some of the literature, the structure constants play an important role, although 
we will not have occasion to use them in this course. (In the physics literature, the 
structure constants are defined as [Xi,Xj] — V—lJ2k c ijkXk, reflecting the factor 
of \J — 1 difference between the physics definition of the Lie algebra and our own.) 
The structure constants satisfy the following two conditions, 



Cijk ~\~ Cjik 

/ ^ n ■ ■ f j . — 1_ n ■ i n ■ j — I 



for all i,j, k, I. The first of these conditions comes from the skew-symmetry of the 
bracket, and the second comes from the Jacobi identity. (The reader is invited to 
verify these conditions for himself.) 

9. The Complexification of a Real Lie Algebra 

Definition 3.34. If V is a finite- dimensional real vector space, then the com- 
plexification ofV, denoted Vc, is the space of formal linear combinations 
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with v±,V2 S V. This becomes a real vector space in the obvious way, and becomes 
a complex vector space if we define 

i(vi + iv 2 ) = —v 2 + ivi . 

We could more pedantically define Vc to be the space of ordered pairs (vx,v 2 ), 
but this is notationally cumbersome. It is straightforward to verify that the above 
definition really makes Vc into a complex vector space. We will regard V as a real 
subspace of Vc in the obvious way. 

Proposition 3.35. Let g be a finite- dimensional real Lie algebra, and Qc its 
complexification (as a real vector space). Then the bracket operation on q has a 
unique extension to Qc which makes Qc into a complex Lie algebra. The complex 
Lie algebra gc is called the complexification of the real Lie algebra g. 

Proof. The uniqueness of the extension is obvious, since if the bracket oper- 
ation on gc is to be bilinear, then it must be given by 

(3.18) [X x + iX 2 , Y x + iY 2 \ = {[Xx, Yx] - [X 2 , Y 2 ]) + z {[X lt Y 2 ] + [X 2 ,Yx]) • 



To show existence, we must now check that ( 3.18 ) is really bilinea r an d skew- 
symmetric, and that it satisfies the Jacobi identity. It is clear t hat (p. 16 ) is real 
bilinear, and skew-symmetric. The skew-symmetry means that if ( 3.1 8| ) is complex 
linear in the first factor, it is also complex linear in the second factor. Thus we 
need only show that 

(3.19) [i(Xi + iX 2 ), Y x + iY 2 ] = i [X x + iX 2 , Yx + iY 2 ] . 



Well, the left side of ( f3T9| ) is 

[-X 2 + iXx,Yx + iY 2 \ = (- [X 2 , Yx] - [Xx,Y 2 ]) + 1 ([Xx,Yx] - [X 2 ,Y 2 ]) 



whereas the right side of ( p.l9[ ) is 

i {([Xx,Yx] - [X 2 ,Y 2 ]) + i ([X 2 , Yx] + [Xx,Y 2 ])} 
= (- [X 2 , Yx] - [Xx,Y 2 ]) + 1 ([Xx,Yx] - [X 2 ,Y 2 ]) , 

and indeed these are equal. 

It remains to check the Jacobi identity. Of course, the Jacobi identity holds if 
X, Y, and Z are in g. But now observe that the expression on the left side of the 
Jacobi identity is (complex!) linear in X for fixed Y and Z. It follows that the 
Jacobi identity holds if X is in gc, and Y, Z in g. The same argument then shows 
that we can extend to Y in gc, and then to Z in gc- Thus the Jacobi identity holds 
in qc- □ 



Proposition 3.36. The Lie algebras gl(n;C), sl(n;C) ; so(n;C) ; and sp(n;C) 
are complex Lie algebras, as is the Lie algebra of the complex Heisenberg group. In 
addition, we have the following isomorphisms of complex Lie algebras 

g\(n;R) c S g|(n;C) 

u(n)c = gl(«;C) 

sl(n;R) c = sl(n;C) 

so(n)c — so(n;C) 

sp(n;R)c — sp(n;C) 

sp(n) c = sp(n;C). 



50 



3. LIE ALGEBRAS AND THE EXPONENTIAL MAPPING 



Proof. From the computations in the previous section we see easily that the 
specified Lie algebras are in fact complex subalgebras of gl(n;C), and hence are 
complex Lie algebras. 

Now, gl(n; C) is the space of all n x n complex matrices, whereas gl (n; R) is the 
space of all n x n real matrices. Clearly, then, every X e gl (n; C) can be written 
uniquely in the form X\ + 1X2, with X\,X 2 S gl(n;R). This gives us a complex 
vector space isomorphism of gl (n;R) c with gl(n;C), and it is a triviality to check 
that this is a Lie algebra isomorphism. 

On the other hand, u(n) is the space of all n x n complex skew-self-adjoint 
matrices. But if X is any n x n complex matrix, then 

X-X* X + X* 

X — -\- 

2 2 

X-X* (-iX)-(-iX)* 

= - - — . 

2 2 

Thus X can be written as a skew matrix plus i times a skew matrix, and it is easy 
to see that this decomposition is unique. Thus every X in gl(n; C) can be written 
uniquely as X\ + iX 2 , with X\ and X 2 in u(n). It follows that u(n)c — gl(n; C). 

The verification of the remaining isomorphisms is similar, and is left as an 
exercise to the reader. □ 

Note that u(n)c = gl (n; R) c = gl(n;C). However, u(n) is not isomorphic to 
gl(n;R), except when n = 1. The real Lie algebras u(n) and gl(n;R) are called 
real forms of the complex Lie algebra gl(n; C). A given complex Lie algebra may 
have several non-isomorphic real forms. See Exercise 

Physicists do not always clearly distinguish between a matrix Lie group and 
its (real) Lie algebra, or between a real Lie algebra and its complexification. Thus, 
for example, some references in the physics literature to SU(2) actually refer to the 
complexified Lie algebra, sl(2;C). 



10. Exercises 

The product rule. Recall that a matrix- valued function A(t) is smooth if 
each Aij (t) is smooth. The derivative of such a function is defined as 

dA\ dA i3 



or equivalcntly, 



dt / ■„■ dt 



±A(t) = lim ^ + 
dt h^o h 



Let A{t) and B(t) be two such functions. Prove that A(t)B(t) is again 
smooth, and that 

d T , * . . , v n dA . . , , . d!3 

^[A(t)B{t)] = -B(t)+A(t)-. 

Using the Jordan canonical form, show that every n x n matrix A can be 
written as A = S + N, with S diagonalizable (over C), N nilpotent, and 
SN = NS. Recall that the Jordan canonical form is block diagonal, with 
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each block of the form 



\ A 

3. Let X and Y be n x n matrices. Show that there exists a constant C such 
that 

C 



e (X+Y)/ m _ e X/m e Y/m 



< 



for all integers m > 1. 

4. Using the Jordan canonical form, show that every n x n complex matrix A 
is the limit of a sequence of diagonalizable matrices. 

Hint: If the characteristic polynomial of A has n distinct roots, then A 
is diagonalizable. 

5. Give an example of a matrix Lie group G and a matrix X such that e x £ G, 
but X £ g. 

6. Show that two isomorphic matrix Lie groups have isomorphic Lie algebras. 

7. TTie Lie algebra so(3; 1). Write out explicitly the general form of a 4 x 4 real 
matrix in so(3; 1). 



Verify directly that Proposition 3.15 and Theorem 3.1£ hold for the Lie 
algebra of SU(n). 

The Lie algebra su(2). Show that the following matrices form a basis for the 
real Lie algebra su(2): 

if i \ _ 1 ( 1 \ _ x ( i 

o -i E2 * { - 1 o j ^ 3 - 2 i o 



Compute [E\,E2\ : [^2,^3], and [E^,E{\. Show that there is an invert- 
ible linear map (f> : su(2) -> R 3 such that <p([X,Y]) = x 0(F) for all 

IjFd su(2), where x denotes the cross-product on R 3 . 

10. The Lie algebras su(2) and so(3). Show that the real Lie algebras su(2) and 
so(3) are isomorphic. 

Note: Nevertheless, the corresponding groups SU(2) and SO(3) are not 
isomorphic. (Although SO(3) is isomorphic to SU(2)/ {/, — I}.) 

11. The Lie algebras su(2) and sl(2;R). Show that su(2) and sl(2;R) are not 
isomorphic Lie algebras, even though su(2)c = sl(2;R)c- 

Hint: Using Exercise |[ show that su(2) has no two-dimensional subal- 
gebras. 

12. Let G be a matrix Lie group, and g its Lie algebra. For each A £ G, show 
that AdA is a Lie algebra automorphism of g. 

13. Ad and ad. Let X and Y be matrices. Show by induction that 

{ a AX) n {Y) = j^( n \x k Y{-XT- k . 

k=0 ^ ' 

Now show by direct computation that 

e adx (F) = Ad{e x )Y = e x Y e - x . 
You may assume that it is legal to multiply power series term-by-term. (This 



result was obtained indirectly in Equation 3.17.) 
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Hint: Recall that Pascal's Triangle gives a relationship between things 
of the form ("J 1 ) and things of the form (^). 

14. The complexification of a real Lie algebra. Let g be a real Lie algebra, gc its 
complexihcation, and f) an arbitrary complex Lie algebra. Show that every 
real Lie algebra homomorphism of g into f) extends uniquely to a complex 
Lie algebra homomorphism of gc into t). (This is the universal property 
of the complexification of a real Lie algebra. This property can be used as 
an alternative definition of the complexification.) 

f 5. The exponential mapping for SL (2; R). Show that the image of the exponen- 
tial mapping for SL(2;R) consists of precisely those matrices A G SL(2;K) 
such that trace (A) > —2, together with the matrix —I (which has trace —2). 
You will need to consider the possibilities for the eigenvalues of a matrix in 
the Lie algebra si (2; R) and in the group SL (2; M). In the Lie algebra, show 
that the eigenvalues are of the form (A, —A) or (iX, —iX) with A real. In the 
group, show that the eigenvalues are of the form (a, 1/a) or (—a, — 1/a) with 
a real and positive, or else of the form (e ie , e~ lB ) , with 6 real. The case of 
a repeated eigenvalue ((0, 0) in the Lie algebra and (1, 1) or (—1, —1) in the 
group) will have to be treated separately. 

Show that the image of the exponential mapping is not dense in SL (2; R). 

16. Using Exercise |], show that the exponential mapping for GL(n; C) maps onto 
a dense subset of GL(n; C). 

17. The exponential mapping for the Heisenberg group. Show that the exponen- 
tial mapping from the Lie algebra of the Heisenberg group to the Heisenberg 
group is one-to-one and onto. 

18. The exponential mapping for U(n). Show that the exponential mapping 
from u(n) to U(n) is onto, but not one-to-one. (Note that this shows that 
U(n) is connected.) 

Hint: Every unitary matrix has an orthonormal basis of eigenvectors. 

19. Let G be a matrix Lie group, and g its Lie algebra. Let A(t) be a smooth 
curve lying in G, with A{Q) — I. Let X = j^\ t=0 A(t). Show that leg. 

Hint: Use Proposition |Q| . 

Note: This shows that the Lie algebra g coincides with what would be 
called the tangent space at the identity in the language of differentiable 
manifolds. 

20. Consider the space gl(n; C) of all n x n complex matrices. As usual, for X E 
gl(n;C), define adX : gl(n;C) -► gl(n;C) by &dX(Y) = [X,Y]. Suppose 
that X is a diagonalizable matrix. Show, then, that a,dX is diagonalizable 
as an operator on gl(n; C). 

Hint: Consider first the case where X is actually diagonal. 
Note: The problem of diagonalizing dAX is an important one that we 
will encounter again in Chapter 6, when we consider semisimple Lie algebras. 



CHAPTER 4 



The Baker- Campbell-Hausdorff Formula 



1. The Baker-Campbell-Hausdorff Formula for the Heisenberg Group 

A crucial result of Chapter 5 will be the following: Let G and H be matrix 
Lie groups, with Lie algebras q and f), and suppose that G is connected and simply 
connected. Then if <j> : q — > fj is a Lie algebra homomorphism, there exists a unique 
Lie group homomorphism <f> : G — > H such that <j> and <fi are related as in Theorem 



3.18. This result is extremely important because it implies that if G is connected 
and simply connected, then there is a natural one-to-one correspondence between 
the representations of G and the representations of its Lie algebra q (as explained 
in Chapter 5). In practice, it is much easier to determine the representations of 
the Lie algebra than to determine directly the representations of the corresponding 
group. 

This result (relating Lie algebra homomorphisms and Lie group homomor- 
phisms) is deep. The "modern" proof (e.g., Varadarajan, Theorem 2.7.5) makes 
use of the Frobenius theorem, which is both hard to understand and hard to 
prove (Varadarajan, Section 1.3). Our proof will instead use the Baker-Campbell- 
Hausdorff formula, which is more easily stated and more easily motivated than the 
Frobenius theorem, but still deep. 

The idea is the following. The desired group homomorphism <j> : G — > H must 
satisfy 



(4.1) 0( 



e x ) = 



We would like, then, to define <p by this relation. This approach has two serious 
difficulties. First, a given element of G may not be expressible as e x , and even if 



it is, the A may not be unique. Second, it is very far from clear why the 4> in (4.1) 
(even to the extent it is well-defined) should be a group homomorphism. 

It is the second issue which the Baker-Campbell-Hausdorff formula addresses. 
(The first issue will be addressed in the next chapter; it is there that the simple con- 
nectedness of G comes into play.) Specifically, (one form of) the Baker-Campbell- 
Hausdorff formula says that if A and Y are sufficiently small, then 

(4.2) log(e x e Y ) = X + Y + \ [X, Y] + i [X, [A, Y}} - ± [Y, [A, Y]]-\ . 

It is not supposed to be evident at the moment what "• • • " refers to. The only 



important point is that all of the terms in (4.2) are given in terms of A and Y, 



brackets of A and Y, brackets of brackets involving A and Y, etc. Then because 
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is a Lie algebra homomorphism, 

(log (e x e Y )) = 0(Jf) + 4>{Y) + §[0(X), 0(F)] 



(4.3) 



+ ±[4>(X), $(X), 0(F)]] - ±[4>{Y), [4>(X), cf>(Y)]\ 
= log (e 



e 4>(x) e <j>(Y) 



The relation (4.2) is extremely significant. For of course 



and so by (4.1), 



(e X e Y ) 



~ x e Y ) 



log e* 



= e HX) e HY) = 0(e x )0( e ^). 



Thus ( [4. 3| ) tells us that 

Thus, the Baker-Campbell-Hausdorff formula shows that on elements of the form 
e x , with A" small, is a group homomorphism. (See Corollary i.4 below.) 

The Baker-Campbell-Hausdorff formula shows that all the information about 
the group product, at least near the identity, is "encoded" in the Lie algebra. Thus 
if <fi is a Lie algebra homomorphism (which by defin ition preserves the Lie algebra 
structure), and if we define <fi near the identity by (4.1), then we can expect (j> to 
preserve the group structure, i.e., to be a group homomorphism. 

In this section we will look at how all of this works out in the very special case 
of the Heisenberg group. In the next section we will consider the general situation. 

Theorem 4.1. Suppose X and Y are nx n complex matrices, and that X and 
Y commute with their commutator. That is, suppose that 



Then 



[X,[X,Y}] = [Y,[X,Y]}=0. 



e x e Y = e x + Y +k[ x > Y \. 



This is the special case of (4.2) in which the series terminates after the [X, Y] 
term. 

Proof. Let X and Y be as in the statement of the theorem. We will prove 
that in fact 

4-2 



e tx e tY 



r 

exp [tX + tY + — [X, Y] 



which reduces to the desired result in the case t = 1. Since by assumption [A", Y] 
commutes with everything in sight, the above relation is equivalent to 



(4.4) 



Let us call the left side of ( |4.4| ) A(t) and the right side B{t). Our strategy 
will be to show that A (t) and B (t) satisfy the same differential equation, with the 
same initial conditions. We can see right away that 



y) 
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On the other hand, differentiating A (t) by means of the product rule gives 
(4.5) 

— - e tx Xe tY e-^ x ^ + e tx e tY Ye~^ x ^ + e tx e tY e~^ x ^ (-t [X, Y}) . 
at 

(You can verify that the last term on the right is correct by differentiating term- 
by-term.) 

Now, since X and Y commute with [X, Y], they also commute with e~^ x ' Y ^. 
Thus the second term on the right in (^^) can be rewritten as 

e tx e tY e~^ x - Y W. 



The first term on the right in (4.5) is more complicated, since X does not necessarily 
commute with e tY . However, 

Xe tY = e tY e- tY Xe tY 
= e tY Ad(e~ tY ) (X) 
= e«V tady (X) . 
But since [Y, [Y, X]] = - [Y, [X, Y]] = 0, 

e -tady (x) = X - t[Y, X] = X + 1 [X, Y] 
with all higher terms being zero. Using the fact that everything commutes with 
e -V[ x ' y l gives 

e tx Xe tY e-^ x ^ =e tx e tY e-^ x - Y ^ (X + t[X,Y\) 



Making these substitutions into (4.5) gives 

^ = e tx e tY e~^ x ^ (X + t [X, Y]) + e tx e tY e~^ x ^Y + e tx e tY e~^ x ^ (-t [X, Y]) 

= e tx e tY e-^ x ^ (X + Y) 
= A{t) (X + Y). 

Thus A (t) and B (t) satisfy the same differential equation. Moreover, A (0) = 
B (0) = /. Thus by standard uniqueness results for ordinary differential equations, 
A(t) = B (t) for all t. □ 



Theorem 4.2. Let H denote the Heisenberg group, and t) its Lie algebra. Let 
G be a matrix Lie group with Lie algebra g, and let <fr : f) — * g be a Lie algebra 
homomorphism. Then there exists a unique Lie group homomorphism <fi : H — > G 
such that 

<f>(e x )=e*W 

for all let). 

Proof. Recall that the Heisenberg group has the very special property that 
its exponential mapping is one-to-one and onto. Let "log" denote the inverse of 
this map. Define <fi '■ H — > G by the formula 

4>{A) = e ^ A) . 
We will show that (f> is a Lie group homomorphism. 
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If X and Y are in the Lie algebra of the Heisenberg group (3x3 strictly 
upper-triangular matrices), then [X, Y] is of the form 




such a matrix commutes with both X and Y. That is, X and Y commute with 
their commutator. Since is a Lie algebra homomorphism, <f> (X) and <p (Y) will 
also commute with their commutator: 



~<i>(x), 


'<f> (X) , 


4>{Y) 


' = <f>([X,[X,Y}}) = 




4>{X), 


HY) 


= 0([Y,[X,Y]})=O. 



We want to show that is a homomorphism, i.e., that 4> (AB) = cj> (A) cf> (B). 
Well, A can be written as e x for a unique I € t) and B can be written as e Y for a 
unique Yet). Thus by Theorem [Ol 



.(AB) 



(e x e Y ) 



Using the definition of <j) and the fact that is a Lie algebra homomorphism: 

(AB) = exp U (X) + (Y) + 1 p (X) , (y) 

Finally, using Theorem |4.l| again we have 

(AB) = e^'e^ = (A) ^ (B) . 

Thus is a group homomorphism. It is easy to check that <f> is continuous (by 
checking that log, exp, and <f> are all continuous), and so <f> is a Lie group homo- 
morphism. Moreover, (f> by definition has the right relationship to 4>. Furthermore, 
since the exponential mapping is one-to-one and onto, there can be at most one <p 
with <f> (e x ) — e^ x K So we have uniqueness. □ 

2. The General Baker-Campbell-Hausdorff Formula 

The importance of the Baker-Campbell-Hausdorff formula lies not in the details 
of the formula, but in the fact that there is one, and the fact that it gives log(e x e Y ) 
in terms of brackets of X and Y, brackets of brackets, etc. This tells us something 
very important, namely that (at least for elements of the form e x , X small) the 
group product for a matrix Lie group G is completely expressible in terms of the Lie 
algebra. (This is because l og ( e x e Y ) , and hence also e x e Y itself, can be computed 
in Lie-algebraic terms by (ff.2|).) 

We will actually state and prove an integral form of the Baker-Campbell- 
Hausdorff formula, rather than the series form (4.2). However, the integral form is 
sufficient to obtain the desired result (4.3). (See Corollary 4.4.) The series form of 
the Baker-Campbell-Hausdorff formula is stated precisely and proved in Varadara- 
jan, Sec. 2.15. 

Consider the function 

logz 



SO) = 



1 
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This function is defined and analytic in the disk {\z — 1| < 1}, and thus for z in 
this set, g(z) can be expressed as 

oo 
m=0 

This series has radius of convergence one. 

Now suppose V is a finite-dimensional complex vector space. Choose an arbi- 
trary basis for V, so that V can be identified with C™ and thus the norm of a linear 
operator on V can be defined. Then for any operator A on V with \\A — I\\ < 1, 
we can define 

oo 

g(A) = 5> m (A-ir. 

m— 

We are now ready to state the integral form of the Baker-Campbell-Hausdorff for- 
mula. 

Theorem 4.3 (Baker-Campbell-Hausdorff). For allnxn complex matrices X 
and Y with \\X\\ and \\Y\\ sufficiently small, 

(4.6) \og(e x e Y ) = X + [ g(e adx e tadY )(Y) dt. 

Jo 

Corollary 4.4. Let G be a matrix Lie group and g its Lie algebra. Suppose 
that 4> : g — > gl(n;C) is a Lie algebra homomorphism. Then for all sufficiently 
small X,Y in g, log (e x e Y ^f is in g, and 

(4.7) 4> [log (e x e Y )} = log (e^e^) . 

Note that e adx e* ady , and hence also g(e adx e tadY ), is a linear operator on the 



space gl(n; C) of allnxn complex matrices. In (4.6), this operator is being applied to 
the matrix Y. The fact that X and Y are assumed small guarantees that e adx e tadY 
is close to the identity operator on gl(n;C) for all < t < 1. This ensures that 
g(e adx e tadY ) is well defined. 

If X and Y commute, then we expect to have log (e x e y ) = log(e x+Y ) = X+Y. 
Exercise || asks you to verify that the Baker-Campbell-Hausdorff formula indeed 
gives X + Y in that case. 



Formula (4.6) is admittedly horrible-looking. However, we are interested not in 
the details of the formula, but in the fact that it expresses log (e x e Y ) (and hence 
e x e Y ) in terms of the Lie-algebraic quantities adX and adY. 

The goal of the Baker-Campbell-Hausdorff theorem is to compute log (e x e Y ). 
You may well ask, "Why don't we simply expand both exponentials and the loga- 
rithm in power series and multiply everything out?" Well, you can do this, and if 
you do it for the first several terms you will get the same answer as B-C-H. However, 
there is a serious problem with this approach, namely: How do you know that the 
terms in such an expansion are expressible in terms of commutators? Consider for 
example the quadratic term. It is clear that this will be a linear combination of X 2 , 
Y 2 , XY, and YX, But to be expressible in terms of commutators it must actually 
be a constant times (XY — YX). Of course, for the quadratic term you can just 
multiply it out and see, and indeed you get | (XY — YX) = | [X, Y}. But it is far 
from clear how to prove that a similar result occurs for all the higher terms. See 
Exercise [|. 
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Proof. We begin by proving that the corollary follows from the integral form 
of the Baker-Campbell-Hausdorff formula. The proof is conceptually similar to the 
reasoning in Equation ( |4.3| ). Note that if X and Y lie in some Lie algebra g then 
adX and adY" will preserve g, and so also will g(e &AX e ta,dY )(Y). Thus whenever 



formula (1.6) holds, log (e x e Y ) will lie in g. It remains only to verify (4.7). The 
idea is that if (f> is Lie algebra homomorphism, then it will take a big horrible looking 
expression involving 'ad' and X and Y, and turn it into the same expression with 
X and Y replaced by <fi (X) and <fi {Y). 

More precisely, since <p is a Lie algebra homomorphism, 

4>[Y,X] = $(Y),4>{X)] 

or 

4> (ady {X)) = ad^(r) (j>(X)j . 

More generally, 

0((adF)"(X)) = (ad0(y))"(0(X) 
This being the case, 

0(e ady (X)) = £^(( a dYf(X)) 



m! 
m— 



00 j-m , 

V — ( 

^ to! V 



Similarly, 



Jad0(r)) (0(X)) 

m=U 

= e tat * y > (4>{X)) . 
\e adx e tadY {X)) = e ^(A) e tad0(y) fr^X 



Assume now that X and Y are small enough that B-C-H applies to X and Y, and 
to <$>(X) and 0(V). Then, using the linearity of the integral and reasoning similar 
to the above, we have: 



l 

' dt 



(log (e x e Y )) = ftX) + I a ^ [(^ adX e tady - /)" (X) 

m=0 

™1 



J° m=0 V ' 



= log ( e ?W e ito) . 

This is what we wanted to show. □ 

Before coming to the proof Baker-Campbell-Hausdorff formula itself, we will 
obtain a result concerning derivatives of the exponential mapping. This result is 
valuable in its own right, and will play a central role in our proof of the Baker- 
Campbell-Hausdorff formula. 

Observe that if X and Y commute, then 

X+tY X tY 

e = e e 
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.1!) 



and so 



^X+tY 



„X d 



-,tY 



dt\t=0 dt\t=0 

In general, X and Y do not commute, and 



3 X Y. 



^X + tY 



+ e x Y 



dt\t=0 

This, as it turns out, is an important point. In particular, note that in the language 
of multivariate calculus 



(4.8) 



— I 

dt 1 4=0 ' 



X+tY _ 



directional derivative of exp at X, 
in the direction of Y 



Thus computing the left side of (4.8) is the same as computing all of the directional 
derivatives of the (matrix- valued) function exp. We expect the directional derivative 
to be a linear function of Y, for each fixed X. 
Now, the function 



1 



!-(!-« + |r- 



z z 

is an entire analytic function of z, even at z = 0, and is given by the power series 

z 00 

- = E(- 1 )"" : 



i 



z n 1 z z 2 

= 1 -2! + 3! 
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This scries (which has infinite radius of convergence), make sense when z is replaced 
by a linear operator A on some finite-dimensional vector space. 

Theorem 4.5 (Derivative of Exponential). Let X and Y be nxn complex ma- 
trices. Then 



„X + tY 



X 



^ = eX { Y - 21 • 3! 

More generally, if X (t) is a smooth matrix-valued function, then 



-&dX 



-on 



&dX 

[X,Y] , [X,[X,Y\] 



(4.10) 



di 



e X(t) = e x(o) 



I 



-adJC(O) 



adX(O) 



( 



dX I > 
dt I (=07 



Note that the d irec tional derivative in (4.£) i s inde ed linear in Y for each fixed 
X. Note also that (4.9) is just a special case of (4.10), by taking X(t) = X + tY, 
and evaluating at t = Q. 

Furthermore, observe that if X and Y commute, then only the first term in the 
series (4.9) survives. In that case, we obtain ^\ t ^ eX+tY = e x Y as expected. 

PROOF. It is possible to prove this Theorem by expanding everything in a 
power series and diffe rent iating term-by-term; we will not take th at ap proach. We 
will prove only form (4.£) of the derivative formula, but the form fl4.10| ) follows by 
the chain rule. 

Let us use the Lie product formula, and let us assume for the moment that it 
is legal to interchange limit and derivative. (We will consider this issue at the end.) 
Then we have 



-x 



dt 



„X+tY 



-X 



t=0 



lim — 

n^oo dt 



7=0 
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We now apply the product rule (generalized to n factors) to obtain 

n-l 



-X 



dt 



t=o 



JC+tY = £ -X 



lkj] (e x / n e tY ' n ) n k 1 (e x ' n e tY ' n Y/ 

71 — 1 _ t _ 1 ' 

Hm We*/") ' (e x ' n Y/ 



e X/n e tY/n 



k=0 
n-l 



But 



= lim iV (e*M yfe^M 

n— >oo n ' — ' V / \ / 

fc=0 

(e^)"*y( e ^/") fc 



Adfe"*/" 



(^) 



( e - adX /™) (y) 



(where we have used the relationship between and ad). So we have 

n-l 



(4.11) 



-x 



dt 



= lim 



t=o 



-adX/ n 



Observe now that X)fe=o (e _ad *' n ) is a geometric series. Let us now reason 
for a moment at the purely formal level. Using the usual formula for geometric 
series, we get 

( e -adX/n\ n 



-X 



dt 



^X+tY _ 



= lim — 



II 



t=o 



J g— adX/ r. 



lim 



-adX 



= lim 



adX , (adX) 2 
n i S^2T 



-adX 



adA - 

n2! 



I-e 



-adX 



-(^) 

-on- 



adA 

This is what we wanted to show! 

Does this argument make sense at any rigorous level? In fact it does. As 
usual, let us consider first the diagonalizable case. That is, assume that &dX is 
diagonalizable as an operator on gl(n; C), and assume that Y is an eigenvector for 
adA. This means that adA(Y") = [X, Y] = XY, for some A 6 C. Now, there are 
two cases, A = and A ^ 0. The A = case corresponds to the case in which X 
and Y commute, and we have already observed that the Theorem holds trivially in 
that case. 

The interesting case, then, is the case A ^ 0. Note that (adA)™ (Y) — X n Y, 
and so 



L-'AxiA (y) 



(Y). 



Thus the geometric series in (4.11) becomes an ordinary complex- valued series, with 
ratio e~ A / n . Since A ^ 0, this ratio will be different from one for all sufficiently 
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large n. Thus we get 



dt 



1 I - ie- x ' r ' 



e x+tY = km ^ ' I Y. 

t _ U^kj n I — e~ x l n J 



There is now no trouble in taking the limit as we did formally above to get 

-x ± 
dt 



e X+tY _ 1 6 X y 
t=0 A 



-adX 

<Y). 



adX 

We see then that the Theorem holds in the case that a.dX is diagonalizable and 
Y is an eigenvector of a,dX. If adX is diagonalizable but Y is not an eigenvector, 
then Y is a linear combination of eigenvectors and applying the above computation 
to each of those eigenvectors gives the desired result. 

We need, then, to consider the case where a.dX is not diagonalizable. But 
(Exercise if X is a diagonalizable matrix, then adX will be diagonalizable 
as an operator on gl(n;C). Since, as we have already observed, every matrix is 
the limit of diagonalizable matrices, we are essentially done. For it is easy to see 
by differentiating the power series term-by-term that e~ x ji\ t _ e x+tY exists and 
varies continuously with X. Thus once we have the Theorem for all diagonalizable 
X we have it for all X by passing to the limit. 

The only unresolved issue, then, is the interchange of limit and derivative which 
we performed at the very beginning of the argument. I do not want to spell this 
out in detail, but let us see what would be involved in justifying this. A standard 
theorem in elementary analysis says that if f n {t) — > f(t) pointwise, and in addition 
dfn/dt converges uniformly to some function g(t), then f(t) is diffcrentiable and 
df /dt = g(t). (E.g., Theorem 7.17 in W. Rudin's Principles of Mathematical Anal- 
ysis.) The key requirement is that the derivatives converge uniformly Uniform 
convergence of the /„'s themselves is definitely not sufficient. 

In our case, /«(£) = e~ x (e x /"e ty / n ) . The Lie product formula says that this 
converges pointwise to e~ x e x+tY . We need, then, to show that 

-e- x (e x / n e tY / n Y 
dt V / 

converges uniformly to some g(t), say on the interval — 1 < t < 1. This computation 
is similar to what we did above, with relatively minor modifications to account for 
the fact that we do not take t = and to make sure the convergence is uniform. 
This part of the proof is left as an exercise to the reader. □ 

2.1. Proof of the Baker-Campbell-Hausdorff Formula. We now turn to 
the proof of the Baker-Campbell-Hausdorff formula itself. Our argument follows 
Miller, Sec. 5.1, with minor differences of convention. (Warning: Miller's "Ad" is 
what we call "ad.") Define 

Z(t)= log (e x e tY ) 

If X and Y are sufficiently small, then Z (t) is defined for < t < 1. It is left as an 
exercise to verify that Z{t) is smooth. Our goal is to compute Z{1). 
By definition 

e Z(t) = £ X e tY 



1.2 



4. THE BAKER-CAMPBELL-HAUSDORFF FORMULA 



so that 

d 7,1 f\ I V t v\ — 1 



dt v ; 



On the other hand, by Theorem 



,- Z (t)d Z (t) (I-e-^ z W] (dZ\ 



dt 6 { adZ(t) \\dt ) 

Hence 

j _ e -adZ(t) -J _ 



adZ{t) J \ dt J 

If X and Y are small enough, then Z(t) will also be small, so that (i - e _adir W) /&dZ(t) 
will be close to the identity and thus invertible. So 

dZ (I- e-^W * 1 



^ *={ adZ(t) | (n 

Recall that e Z( -*) = e x e tY . Applying the homomorphism 'Ad' gives 
Ad ( e z W) = Ad (e x ) Ad (e* r ) . 



By the relationship ( p.l7|) between 'Ad' and 'ad,' this becomes 



e adZ(t) _ gadXgtadY 



or 

adX tadF\ 



adZ(t) = log (e adx e tady ) . 



Plugging this into ( 4.12 ) gives 

gj t-r,"T' r~ 

1 j \ log(e adX e tad ^) J 1 j ' 

But now observe that 

' l-z- 



log 2 



so, formally, ( 4.13 ) is the same as 

(4.14) ^=g(e adX e tadY )(Y). 

Reasoning as in the proof of Theorem |4.5| shows easily that this formal argument 
is actually correct. 

Now we are essentially done, for if we note that Z(Q) = X and integrate (4.14), 
we get 

Z(l) = X + f g(e &dx e tadY )(Y) dt 
Jo 

which is the Baker-Campbell-Hausdorff formula. 
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3. The Series Form of the Baker-Campbell-Hausdorff Formula 



Let us see how to get the first few terms of the series form of B-C-H from the 
integral form. Recall the function 

zlogz 



9{z) 



1 



[! + (*-!)] 



1 - 



(z-1) 

z-i {z-iy 



Multiplying this out and combining terms gives 

g{z) = l+ l -{z-l)-\(z-lf + 



The closed-form expression for g is 



5W = 1 + E 



(-1) 



n+l 



^-f n(n + l) 

n=l v ' 



Meanwhile 



e adX et adY_ /= I + &dX+ + . 



7 + tadY + 



t 2 (ady) 2 
2 

\2 



+ • 



- 7 



1V (adXr t 2 (adYY 
= adX + tadY + tadXadY + v ' H v - ; H . 

The crucial observation here is that e adx e tadY — J has no zero-order term, just 
first-order and higher in adX/adY. Thus (e adx e tadY — 7) will contribute only 
terms of degree n or higher in adX/adF. 

We have, then, up to degree two in adX/adF 



adA: tadY 



)=! + 



adX + tadY + ta.dXa.dY + 



(adX) 2 t 2 (adYY 



- - [adX + tadY + ---f 

T 1 , v * * , v „, (adX) 2 t 2 (adF) 2 

= 7 + -adX + -ady + -adXadF + ^ '— -\ i '- 

2 2 2 4 4 



- - {adXy + (adF) z + tadXadY + tadYadX 
6 L 

+ higher-order terms. 

We now to apply g ( e adx e * ady ) to Y and integrate. So (neglecting higher-order 
terms) by B-C-H, and noting that any term with adY acting first is zero: 

log(e x e Y ) 



X + 



Y + \ [X, Y] + \ [X, [X, Y}} I [X, [X, Y}} I % [X, Y]] 



dt 



= X + Y+±[X,Y]+(±-£) [X, [X, Y}} -\J Q tdt [Y, [X, Y}} 
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Thus if we do the algebra we end up with 

X + Y+±[X,Y]-\ 
higher order terms 



log (e x e Y ) = X + Y+ l -[X 1 Y] + ^[X, [X, Y]] 1 [Y, [X, Y}\ 



This is the expression in (4.2). 

4. Subgroups and Subalgebras 

Suppose that G is a matrix Lie group, H another matrix Lie group, and suppose 
that H C G. Then certainly the Lie algebra f) of H will be a subalgebra of the Lie 
algebra g of G. Does this go the other way around? That is given a Lie group G 
with Lie algebra g, and a subalgebra f) of g, is there a matrix Lie group H whose 
Lie algebra is f)? 

In the case of the Heisenberg group, the answer is yes. This is easily seen using 
the fact that the exponential mapping is one-to-one and onto, together with the 
special form of the Baker-Campbell-Hausdorff formula. (See Exercise [|) 

Unfortunately, the answer in general is no. For example, let G = GL (2; C) and 

let 



it 
ita 



t £ 



where a is irrational. If there is going to be a matrix Lie group H with Lie algebra 
f), then H would contain the set 



Hn = 



e lt 
e ita 



t e 



To be a matrix Lie group, H would have to be closed in GL (2; C), and so it would 
contain the closure of Hq, which (see ) is the set 

e lt 
e ls 



s,t £ 



But then the Lie algebra of H would have to contain the Lie algebra of Hi , which 
is two-dimensional! 

Fortunately, all is not lost. We can still get a subgroup H for each subalgebra 
f), if we weaken the condition that H be a matrix Lie group. In the above example, 
the subgroup we want is Hq, despite the fact that Hq is not a matrix Lie group. 

Definition 4.6. If H is any subgroup o/GL(n;C), define the Lie algebra \) of 
H to be the set of all matrices X such that 

e tx £ H 

for all real t. 



Definition 4.7. If G is a matrix Lie group with Lie algebra q, then H is a 
connected Lie subgroup of G if 

i) H is a subgroup of G 

ii) H is connected 

Hi) the Lie algebra t) of H is a subspace of Q 

iv) Every element of H can be written in the form e Xl e X2 ■ ■ ■ e Xn , with X\ , • • • , X n £ 



5. EXERCISES 



65 



Theorem 4.8. If G is a matrix Lie group with Lie algebra g, and H is a 
connected He subgroup of G, then the Lie algebra f) of H is a subalgebra of g. 

PROOF. Since by definition t) is a subspace of g, it remains only to show that 
\) is closed under brackets. So assume X, Y £ f). Then e tx and e sY are in H, and 
so (since H is a subgroup) is the element 

e tX e sY e ~tX = cxp [ s ( e tXy e -tX^ 

This shows that e tx Ye~ tx is in f) for all t. But f) is a subspace of g, which is 
necessarily a closed subset of q. Thus 

d 



^=dt 



tx tx (e hx Ye- hx -Y) 
e tx Ye -tx = Hm V L 

t=o h 



is in f). (This argument is precisely the one we used to show that the Lie algebra 
of a matrix Lie group is a closed under brackets, once we had established that it is 
a subspace.) □ 

We are now ready to state the main theorem of this section, which is our second 
major application of the Baker-Campbell-Hausdorff formula. 

THEOREM 4.9. Let G be a matrix Lie group with Lie algebra g. Let I) be a Lie 
subalgebra of g. Then there exists a unique connected Lie subgroup H of G such 
that the Lie algebra of H is (). 

Given a matrix Lie group G and a subalgebra f) of g, the associated connected 
Lie subgroup H might be a matrix Lie group. This will happen precisely if H 
is a closed subset of G. There are various conditions under which you can prove 
that H is closed. For example, if G = GL(n;C), and f) is semisimple, then H is 
automatically closed, and hence a matrix Lie group. (See Helgason, Chapter II, 
Exercises and Further Results, D.) 

If only the Baker-Campbell-Hausdorff formula worked globally instead of only 
locally the proof of this theorem would be easy. If the B-C-H formula converged 
for all X, Y we could just define H to be the image of t) under the exponential 
mapping. In that case B-C-H would show that this image is a subgroup, since then 

we would have e Hl e H2 = e z , with Z = H x + H 2 + \ [Hi, H 2 ] H G I) provided 

that Hi,H 2 G i). Unfortunately, the B-C-H formula is not convergent in general, 
and in general the image of H under the exponential mapping is not a subgroup. 

PROOF. Not written at this time. □ 

5. Exercises 

1. The center of a Lie algebra g is defined to be the set of all X e g such that 
[X, Y] = for all Y e g. Now consider the Heisenberg group 



with Lie algebra 
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Determine the center Z(t)) of f). For any X, Y S f), show that [X, Y] S 
Z(t)). This implies, in particular that both X and 1" commute with their 
commutator [X, Y] . 

Show by direct computation that for any X, Y G t), 



(4.15) e x e Y = e x+Y+ l^ Y \ 

2. Let X be a ji x n complex matrix. Show that 

I-e- x 



X 

is invertible if and only if X has no eigenvalue of the form A = 27rin, with n 
an non-zero integer. 

Hint: When is (1 — e~ z ) jz equal to zero? 



Remark: This exercise, combined with the formula in Theorem 4.5 



gives the following result (in the language of differentiable manifolds): The 
exponential mapping exp : g — > G is a local diffcomorphism near X € q if 
and only adA" has no eigenvalue of the form A = 2mn, with n a non-zero 
integer. 

Verify that the right side of the Baker-Campbell-Hausdorff formula (4.6) 
reduces to X + Y in the case that X and Y commute. 

Hint: Compute first e &dx e tadY ' {Y) and ( e ad * e tady - I) (Y). 
Compute log (e x e Y ) through third order in X/Y by using the power series 
for the exponential and the logarithm. Show that you get the same answer 
as the Baker-Campbell-Hausdorff formula. 

Using the techniques in Section |^, compute the series form of the Baker- 
Campbell-Hausdorff formula up through fourth-order brackets. (We have 
already computed up through third-order brackets.) 

Let a be a subalgebra of the Lie algebra of the Heisenberg group. Show that 
exp (a) is a connected Lie subgroup of the Heisenberg group. Show that in 
fact exp (a) is a matrix Lie group. 

Show that every connected Lie subgroup of SU (2) is closed. Show that this 
is not the case for SU (3). 



CHAPTER 5 



Basic Representation Theory 

1. Representations 

Definition 5.1. Let G be a matrix Lie group. Then a finite- dimensional 
complex representation of G is a Lie group homomorphism 

n : G — > GL(n; C) 

(n > 1) or more generally a Lie group homomorphism 

n : G — > GL(V) 

where V is a finite- dimensional complex vector space (with dim{V) > 1). A finite- 
dimensional real representation of G is a Lie group homomorphism II of G 
into GL(n;M) or into GL(V), where V is a finite- dimensional real vector space. 

If g is a real or complex Lie algebra, then a finite- dimensional complex 
representation of $ is a Lie algebra homomorphism ir of g into gl(n;C) or into 
gl(V), where V is a finite- dimensional complex vector space. If g is a real Lie 
algebra, then a finite- dimensional real representation of g is a Lie algebra 
homomorphism n of g into gl(n;R) or into g\(V). 

If U or tt is a one-to-one homomorphism, then the representation is called 
faithful. 

You should think of a representation as a (linear) action of a group or Lie 
algebra on a vector space. (Since, say, to every g g G there is associated an 
operator 11(5), which acts on the vector space V .) In fact, we will use terminology 
such as, "Let II be a representation of G acting on the space V." Even if g is a real 
Lie algebra, we will consider mainly complex representations of g. After making a 
few more definitions, we will discuss the question of why one should be interested 
in studying representations. 

Definition 5.2. Let H be a finite- dimensional real or complex representation 
of a matrix Lie group G, acting on a space V. A subspace W of V is called 
invariant ifU(A)w e W for all w e W and all A e G. An invariant subspace W 
is called non-trivial ifW^ {0} and W 7^ V . A representation with no non-trivial 
invariant subspaces is called irreducible. 

The terms invariant, non-trivial, and irreducible are defined analogously 
for representations of Lie algebras. 

Definition 5.3. Let G be a matrix Lie group, let II be a representation of 
G acting on the space V , and let E be a representation of G acting on the space 
W. A linear map (f> : V —> W is called a morphism (or intertwining map) of 

representations if 

4>(U(A)v) = E(A)0(w) 
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for all A G G and all v G V . The analogous property defines morphisms of repre- 
sentations of a Lie algebra. 

If $ is a morphism of representations, and in addition 4> is invertible, then 
4> is said to be an isomorphism of representations. If there exists an isomor- 
phism between V and W , then the representations are said to be isomorphic (or 
equivalent). 

Two isomorphic representations should be regarded as being "the same" rep- 
resentation. A typical problem in representation theory is to determine, up to 
isomorphism, all the irreducible representations of a particular group or Lie alge- 
bra. In Section 5.4 we will determine all the finite-dimensional complex irreducible 
representations of the Lie algebra su(2). 

Proposition 5.4. Let G be a matrix Lie group with Lie algebra g, and let II 
be a (finite- dimensional real or complex) representation of G, acting on the space 
V. Then there is a unique representation tt of g acting on the same space such that 

Tl(e x ) = 

for all X G g. The representation tt can be computed as 



n (e tx ) 



=0 



and satisfies 



tt (AX A' 1 ) = Il(A)Tr{X)IL(Ay 
for all X G g and all A G G. 



Proof. Theorem 3.18 in Chapter 3 states that for each Lie group homomor- 
phism cf> : G — ► H there is an associated Lie algebra homomorphism <f> '■ —> I)- 
Take H = GL(V) and cf> — II. Since the Lie algebra of GL(V) is gl(V) (since the ex- 
ponential of any operator is invertible) , the associated Lie algebra homomorphism 
<f> = n maps from g to gl(^), and so constitutes a representation of g. 

The properties of tt follow from the properties of <p given in Theorem 6. □ 

Proposition 5.5. Let g be a real Lie algebra, and gc its complexification. 
Then every finite-dimensional complex representation tt of g has a unique extension 
to a (complex-linear) representation of gc also denoted tt. The representation of 
gc satisfies 

TT(X +iY) = TT(X)+iTT(Y) 

for all X G g . 

PROOF. This follows from Exercise |l~4| of Chapter 3. □ 

Definition 5.6. Let G be a matrix Lie group, let H. be a Hilbert space, and 
let U(TL) denote the group of unitary operators on Ti. Then a homomorphism 
II : G — * U(H) is called a unitary representation of G ifH satisfies the following 
continuity condition: If A n , A G G and A n — > A, then 

U(A n )v -> U(A)v 

for all v G TL. A unitary representation with no non-trivial closed invariant sub- 
spaces is called irreducible. 



2. WHY STUDY REPRESENTATIONS? 
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This continuity condition is called strong continuity. One could require the 
even stronger condition that ||n(^4„) — 11(^4) | — > 0, but this turns out to be too 
stringent a requirement. (That is, most of the interesting representations of G will 
not have this stronger continuity condition.) In practice, any homomorphism of G 
into U{TL) you can write down explicitly will be strongly continuous. 

One could try to define some analog of unitary representations for Lie alge- 
bras, but there are serious technical difficulties associated with getting the "right" 
definition. 



2. Why Study Representations? 

If a representation II is a faithful representation of a matrix Lie group G, then 
{11(A) \A £ G} is a group of matrices which is isomorphic to the original group G. 
Thus II allows us to represent G as a group of matrices. This is the motivation for 
the term representation. (Of course, we still call II a representation even if it is not 
faithful.) 

Despite the origin of the term, the point of representation theory is not (at 
least in this course) to represent a group as a group of matrices. After all, all of 
our groups are already matrix groups! While it might seem redundant to study 
representations of a group which is already represented as a group of matrices, this 
is precisely what we are going to do. 

The reason for this is that a representation can be thought of (as we have 
already noted) as an action of our group on some vector space. Such actions (rep- 
resentations) arise naturally in many branches of both mathematics and physics, 
and it is important to understand them. 

A typical example would be a differential equation in three-dimensional space 
which has rotational symmetry. If the equation has rotational symmetry, then the 
space of solutions will be invariant under rotations. Thus the space of solutions will 
constitute a representation of the rotation group SO(3). If you know what all of the 
representations of SO(3) are, this can help immensely in narrowing down what the 
space of solutions can be. (As we will see, SO(3) has lots of other representations 
besides the obvious one in which SO (3) acts on R 3 .) 

In fact, one of the chief applications of representation theory is to exploit sym- 
metry. If a system has symmetry, then the set of symmetries will form a group, and 
understanding the representations of the symmetry group allows you to use that 
symmetry to simplify the problem. 

In addition, studying the representations of a group G (or of a Lie algebra q) 
can give information about the group (or Lie algebra) itself. For example, if G 
is a finite group, then associated to G is something called the group algebra. 
The structure of this group algebra can be described very nicely in terms of the 
irreducible representations of G. 

In this course, we will be interested primarily in computing the finite-dimensional 
irreducible complex representations of matrix Lie groups. As we shall see, this 
problem can be reduced almost completely to the problem of computing the finite- 
dimensional irreducible complex representations of the associated Lie algebra. In 
this chapter, we will discuss the theory at an elementary level, and will consider in 
detail the example of SO(3) and SU(2). In Chapter 6, we will study the represen- 
tations of SU(3), which is substantially more involved than that of SU(2), and give 
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an overview of the representation theory of a very important class of Lie groups, 
namely, the semisimple ones. 

3. Examples of Representations 

3.1. The Standard Representation. A matrix Lie group G is by definition 
a subset of some GL(n;M) or GL(n;C). The inclusion map of G into GL(n) (i.e., 
H(A) = A) is a representation of G, called the standard representation of G. 
Thus for example the standard representation of SO(3) is the one in which SO(3) 
acts in the usual way on M 3 . If G is a subgroup of GL(n;R) or GL(n;C), then its 
Lie algebra g will be a subalgebra of gl(n;K) or gl(n;C). The inclusion of g into 
gl(n;R) or gl(n;C) is a representation of g, called the standard representation. 

3.2. The Trivial Representation. Consider the one-dimensional complex 
vector space C. Given any matrix Lie group G, we can define the trivial repre- 
sentation of G, II : G — > GL(1; C), by the formula 

11(A) = I 

for all A G G. Of course, this is an irreducible representation, since C has no non- 
trivial subspaces, let alone non-trivial invariant subspaces. If g is a Lie algebra, we 
can also define the trivial representation of g, it : g — > gl(l; C), by 

k(X) = 

for all leg. This is an irreducible representation. 

3.3. The Adjoint Representation. Let G be a matrix Lie group with Lie 
algebra g. We have already defined the adjoint mapping 

Ad : G -» GL(fl) 

by the formula 

AdA(X) = AX A' 1 . 

Recall that Ad is a Lie group homomorphism. Since Ad is a Lie group homomor- 
phism into a group of invertible operators, we see that in fact Ad is a representation 
of G, acting on the space g. Thus we can now give Ad its proper name, the adjoint 
representation of G. The adjoint representation is a real representation of G. 
Similarly, if g is a Lie algebra, we have 

ad : g -> gl(g) 

defined by the formula 

adX(Y) = [X,Y]. 

We know that ad is a Lie algebra homomorphism (Chapter 3, Proposition 3.33| ), and 



is therefore a representation of g, called the adjoint representation. In the case 
that g is the Lie algebra of some matrix Lie group G, we have already established 
(Chapter 3, Proposition [3.21 and Exercise |l3|) that Ad and ad are related as in 



Proposition 5.4 



Note that in the case of SO(3) the standard representation and the adjoint 
representation are both three dimensional real representations. In fact these two 
representations are equivalent (Exercise |]) . 



3. EXAMPLES OF REPRESENTATIONS 
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3.4. Some Representations of SU(2). Consider the space V m of homoge- 
neous polynomials in two complex variables with total degree m (m > 0). That is, 
V m is the space of functions of the form 

(5.1) f(zi,Z 2 ) = a z™ + a 1 z™~ 1 z 2 + a 2 z™- 2 z\ ■■■ + a m z™ 

with Zi, z 2 € C and the dj's arbitrary complex constants. The space V m is an 
(m + l)-dimensional complex vector space. 

Now by definition an element U of SU(2) is a linear transformation of C 2 . Let 
z denote the pair z = (21,22) in C 2 . Then we may define a linear transformation 
II m (f7) on the space V m by the formula 



(5-2) [H m (U)f] (z) 

Explicitly, if / is as in (|5.l|), then 



KU-'z). 



[ILm(U)f] (z U Z 2 ) 



fc=0 



U^z 2 ) 



(U^zt + U 22 'z 2 ) 



By expanding out the right side of this formula we see that H m (U)f is again a 
homogeneous polynomial of degree m. Thus H m (U) actually maps V. m into V m . 
Now, compute 

n m pi) [n m (u 2 ) f] (z) = [n m (u 2 ) f] (u^z) = / (u^u^z) 
= n m {u l u 2 )f{z). 

Thus Ii m is a (finite-dimensional complex) representation of SU(2). (It is very 
easy to do the above computation incorrectly.) The inverse in definition ( |5.2| ) 
is necessary in order to make II m a representation. It turns out that each of 
the representations II m of SU(2) is irreducible, and that every finite-dimensional 
irreducible representation of SU(2) is equivalent to one (and only one) of the Ilm's. 
(Of course, no two of the II m 's are equivalent, since they don't even have the same 
dimension.) 

Let us now co mpu te the corresponding Lie algebra representation 7r m . Accord- 
can be computed as 

d 



ing to Proposition |5.4 



dt 



So 



(n m (X)f) (z) = - 



.-tx , 



t=o 



Now let z(t) be the curve in C defined as z(t) — e z, so that z(0) = z. Of 



course, z(t) can be written as z(t) 
rule, 



df dzi 
dz\ dt 



(zi(t), z 2 (t)), with Zi(t) G C. By the chain 
df dz 2 



+ 



dz 2 dt 



But dz/dt\ t _ = —Xz, so we obtain the following formula for 7r m (X) 

df df 
(5.3) 7T m (X)f = (X n zi + X 12 z 2 ) - -i- (X 21 zi + X 22 z 2 ) 



Now, according to Proposition 5.5, every finite-dimensional complex represen- 
tation of the Lie algebra su(2) extends uniquely to a complex-linear representation 
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of the complexification of su(2). But the complexification of su(2) is (isomorphic 
to) sl(2;C) (Chapter 3, Proposition |3.36|). To see that this is so, note that sl(2;C) 



is the space of all 2 x 2 complex matrices with trace zero. But if X is in sl(2; C), 
then 

X-X* X + X* X-X* X + X* 
X = + = + i- 



2 2 2 2i 

where both (X— X*)/2 and (X+X*)/2i are in su(2). (Check!) It is easy to see that 
this decomposition is unique, so that every X 6 sl(2;C) can be written uniquely as 
X = Xi+ iYi with X\,Y\ € su(2). Thus sl(2; C) is isomorphic as a vector space to 
su(2)c- But this is in fact an isomorphism of Lie algebras, since in both cases 

[X 1 + iY x ,X 2 + iY 2 \ = [X U X 2 ] - [Y U Y 2 ] + i {[X X ,Y 2 ] + [X 2 , y x ]) . 

(See Exercise ||.) 

So, the representation ir m of su(2) given by ([5.3]) extends to a representation 



of si (2; C), which we will also call 7r m . I assert that in fact formula (5.3), still holds 
for X e sl(2;C). Why is this? Well, ( ^3| ) is undoubtedly (complex) linear, and it 
agrees with the original 7r m for X <G su(2). But there is only one complex linear 
extension of 7r m from su(2) to sl(2;C), so this must be it! 
So, for example, consider the element 



1 

-1 



H = 

in the Lie algebra sl(2;C). Applying formula ( |5.3| ) gives 

(n rn (H)f) (z) = ~ J-^i + §~ Z2 ' 

Thus we see that 

d d 

(5.4) 7 r m (H) = -z 1 — + z 2 — . 

oz\ az 2 

Applying Tr m (H) to a basis element z\z™~ k we get 

Tx m {H)z\z^ k = ~kz k z™- k + (m - k)z\z™- k = (m - 2k)z k z 7 2 n - k . 

Thus z\z™~ k is an eigenvector for ir m {H) with eigenvalue (to — 2k). In particular, 
ir m (H) is diagonalizable. 

Let X and Y be the elements 

X=l! M; Y= fQ 



oo/' V i o 



in sl(2;C). Then (^ tells us that 



Tr m (X) = -z 21 g-; n m (Y) = ~ Zl A- 



so that 

TT m (X) Z k zlJ l - k = ~kz k ~ X Z™- k+1 

(5.5) n m (Y)z k z™- k = (k - m)z k+1 z r 2 n - k - 1 . 

Proposition 5.7. The representation ir m is an irreducible representation of 



sl(2;( 
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Proof. It suffices to show that every non-zero invariant subspace of V m is in 
fact equal to V m . So let W be such a space. Since W is assumed non-zero, there is 
at least one non-zero element w in W. Then w can be written uniquely in the form 

w = clqz™ + aiz™ _1 z 2 + a 2 z™~ 2 z 2 h a m z™ 

with at least one of the a^'s non-zero. Let ko be the largest value of k for which 
afc 7^ 0, and consider 

n m {X) k °w. 



Since (by (5.5)) each application of ir m (X) lowers the power of Z\ by 1, ir m (X) k ° 
will kill all the terms in w whose power of Z\ is less than ko, that is, all except the 
a,k z i° z™~ k ° term. On the other hand, we compute easily that 



ir m (X) ko (a ko z^z^- ko ) = k \(-l) k <>a ko z r 2 



We see, then, that 7T m (X) k °w is a non-zero multiple of z™ . Since W is assumed 
invariant, W must contain this multiple of z™, and so also z™ itself. 

But now it follows from ( |5.5| ) that ir m (Y) k z™ is a non-zero multiple of z\ z 2 n ~ k . 
Therefore W must also contain z k z™~ k for all < k < m. Since these elements 
form a basis for V m , we see that in fact W — V m , as desired. □ 

3.5. Two Unitary Representations of SO(3). Let H = L 2 (R 3 ,dx). For 
each R € SO (3), define an operator ili(i?) on Ti by the formula 

p!(iZ)/] (x) = f(R- 1 x). 

Since Lebesgue measure dx is rotationally invariant, n 1 (i?) is a unitary operator 
for each R 6 SO (3). The calculation of the previous subsection shows that the 
map R — > is a homomorphism of SO (3) into U(H). This map is strongly 

continuous, and hence constitutes a unitary representation of SO(3). 

Similarly, we may consider the unit sphere S 2 C K 3 , with the usual surface 
measure ft. Of course, any R 6 SO(3) maps S 2 into S 2 . For each R we can define 
n 2 (i?) acting on L 2 (S 2 , dft) by 

p a (iZ)/] (a:) - / (R-'x) . 

Then II2 is a unitary representation of SO(3). 

Neither of the unitary representations ill and II2 is irreducible. In the case of 
n 2 , L 2 (S 2 ,dfl) has a very nice decomposition as the orthogonal direct sum of finite- 
dimensional invariant subspaces. This decomposition is the theory of "spherical 
harmonics," which are well known in the physics (and mathematics) literature. 

3.6. A Unitary Representation of the Reals. Let H = L 2 (R,dx). For 
each net, define T a : H — ► Ti by 

(T a f) (x) = f(x-a). 

Clearly T a is a unitary operator for each a € K, and clearly T a T\, — T a+ t- The 
map a — > T a is strongly continuous, so T is a unitary representation of R. This 
representation is not irreducible. The theory of the Fourier transform allows you 
to determine all the closed, invariant subspaces of Ti (W. Rudin, Real and Complex 
Analysis, Theorem 9.17). 
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3.7. The Unitary Representations of the Real Heisenberg Group. 

Consider the Heisenberg group 
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Now consider a real, non-zero constant, which for reasons of historical convention 
we will call h ( "aitch-bar" ) . Now for each H € R\{0}, define a unitary operator lift 
on L 2 (R,dx) by 

/ 1 a b \ 

(5.6) n R 1 c } f = e- lhb e thcx f(x-a). 

V o 1 J 

It is clear that the right side of (5.6) has the same norm as /, so lift is indeed 
unitary. 

Now compute 

n,, 

= e -ihb e iK~cx e -ihb e ihc(x-a) _ ~ _ ^ 

= e -ih(b+b+ca) e ih(Z+c)xf ( x _@ + a )) . 

This shows that the map A — ► lift (A) is a homomorphism of the Heisenberg group 
into U (L 2 (M)). This map is strongly continuous, and so lift is a unitary represen- 
tation of H . 

Note that a typical unitary operator H%{A) consists of first translating /, then 
multiplying / by the function e lhcx i and then multiplying / by the constant e~ lhh . 
Multiplying / by the function e lhcx has the effect of translating the Fourier trans- 
form of /, or in physical language, "translating / in momentum space." Now, 
if Ui is an ordinary translation and U2 is a translation of the Fourier transform 
(i.e., U2 = multiplication by some e thcx ), then U\ and U2 will not commute, but 
U1U2U1 U% will be simply multiplication by a constant of absolute value one. 
Thus {lift (A) \A 6 H } is the group of operators on L 2 (R) generated by ordinary 
translations and translations in Fourier space. It is this representation of the Heisen- 
berg group which motivates its name. (See also Exercise |l0| .) 

It follows fairly easily from standard Fourier transform theory (e.g., W. Rudin, 
Real and Complex Analysis, Theorem 9.17) that for each h S K\{0} the repre- 
sentation is irreducible. Furthermore, these are (up to equivalence) almost all 
of the irreducible unitary representations of H . The only remaining ones are the 
one-dimensional representations Tl a ,/3 

b \ 

_ e i(aa+l3c)j 




with a,|3 6 R. (The Il a ^'s are the irreducible unitary representations in which 
the center of H acts trivially.) The fact that n^'s and the n Qi( g's are all of the 
(strongly continuous) irreducible unitary representations of H is closely related 
to the celebrated Stone- Von Neumann theorem in mathematical physics. See, for 
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example, M. Reed and B. Simon, Methods of Modern Mathematical Physics, Vol. 
3, Theorem XL 84. See also Exercise 

4. The Irreducible Representations of su(2) 

In this section we will compute (up to equivalence) all the finite-dimensional 
irreducible complex representations of the Lie algebra su(2). This computation is 
important for several reasons. In the first place, su(2) = so(3), and the repre- 
sentations of so(3) are of physical significance. (The computation we will do here 
is found in every standard textbook on quantum mechanics, under the heading 
"angular momentum.") In the second place, the representation theory of su(2) is 
an illuminating example of how one uses commutation relations to determine the 
representations of a Lie algebra. In the third place, in determining the represen- 
tations of general semisimple Lie algebras (Chapter 6), we will explicitly use the 
representation theory of su(2). 

Now, every finite-dimensional complex representation 7r of su(2) extends by 



Prop. 5.5 to a complex-linear representation (also called tt) of the complexification 



of su(2), namely si (2; C). 

Proposition 5.8. Let tt be a complex representation o/su(2), extended to a 
complex-linear representation o/sl(2;C). Then n is irreducible as a representation 
o/su(2) if and only if it is irreducible as a representation o/sl(2;C). 

Proof. Let us make sure we are clear about what this means. Suppose that 
7T is a complex representation of the (real) Lie algebra su(2), acting on the com- 
plex space V . Then saying that 7r is irreducible means that there is no non-trivial 
invariant complex subspace W C V. That is, even though su(2) is a real Lie alge- 
bra, when considering complex representations we are interested only in complex 
invariant subspaces. 

Now, suppose that tt is irreducible as a representation of su(2). If W is a 
(complex) subspace of V which is invariant under sl(2;C), then certainly W is 
invariant under su(2) C sl(2;C). Therefore W = {0} or W = V. Thus 7r is 
irreducible as a representation of sl(2; C). 

On the other hand, suppose that 7r is irreducible as a representation of si (2; C), 
and suppose that W is a (complex) subspace of V which is invariant under su(2). 
Then W will also be invariant under ir(X+iY) = Tr(X)+iTr(Y), for all X, Y G su(2). 
Since every element of si (2; C) can be written as X + iY, we conclude that in fact 
W is invariant under sl(2;C). Thus W = {0} or W = V, so tt is irreducible as a 
representation of su(2). □ 

We see, then that studying the irreducible representations of su(2) is equivalent 
to studying the irreducible representations of sl(2;C). Passing to the complexified 
Lie algebra makes our computations easier. 

We will use the following basis for sl(2; C): 

H=ll ° V X =(° n M : Y f ° ° 



-1 /' 10 0/' I 1 o 

which have the commutation relations 

[H, X] = 2X 

[H, Y] = -2Y 

[X,Y] = H 
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If V is a (finite-dimensional complex) vector space, and A, B, and C are operators 
on V satisfying 

[A, B] = 2B 
[A, C] = -2C 
[B,C] = A 

then because of the skew-symmetry and bilinearity of brackets, the linear map 
tt : sl(2;C) -> gl(V) satisfying 

Tr(-ff) = A; n(X) = B; n(Y) = C 

will be a representation of si (2; C). 

Theorem 5.9. For each integer m > 0, there is an irreducible representation 
o/sl(2; C) with dimension m+1. Any two irreducible representations o/sl(2; C) with 
the same dimension are equivalent. If tt is an irreducible representation o/sl(2;C) 
with dimension m+1, then n is equivalent to the representation 7r m described in 
Section [|. 

Proof. Let tt be an irreducible representation of sl(2;C) acting on a (finite- 
dimensional complex) space V. Our strategy is to diagonalize the operator tt(H). 
Of course, a priori, we don't know that tt(H) is diagonalizable. However, because 
we are working over the (algebraically closed) field of complex numbers, tt(H) must 
have at least one eigenvector. □ 

Proof. The following lemma is the key to the entire proof. 

Lemma 5.10. Let u be an eigenvector of tt(H) with eigenvalue a € C. Then 

tt(H)tt(X)u = {a + 2)n(X)u. 

Thus either it(X)u = 0, or else tt(X)u is an eigenvector for ir(H) with eigenvalue 
a + 2 . Similarly, 

tt(H)tt(Y)u = (a- 2)7r(Y)u 

so that either tt(Y)u — 0, or else tt(Y)u is an eigenvector for Tt(H) with eigenvalue 
a-2. 

Proof. We call tt(X) the "raising operator," because it has the effect of raising 
the eigenvalue of n(H) by 2, and we call n(Y) the "lowering operator." We know 
that [ir(H),ir(X)} =ir([H,X]) = 2ir(X). Thus 

tt(H)tt(X) - tt(X)tt(H) = 2n(X) 

or 

tt(H)tt(X) = tt{X)tt(H) + 2tt{X). 

Thus 

n(H)ir{X)u = 7r(X)n(H)u + 2tt{X)u 
= tt{X) {cm) + 2tt{X)u 
= (a + 2)ir(X)u. 

Similarly, [w(H),n(Y)] = -2ir(Y), and so 

7r(ff)7r(T) = tt(Y)t:(H) - 2n(Y) 
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SO that 

w(H)n(Y)u = n(Y)n(H)u - 2n(Y)u 
= n(Y) (au) - 2n(Y)u 
= (a-2)n(Y)u. 

This is what we wanted to show. □ 

As we have observed, tt(H) must have at least one eigenvector u (u ^ 0), with 
some eigenvalue a € C. By the lemma, 

ir(H)ir(X)u = (a + 2)n{X)u 

and more generally 

ir(H)ir(X) n u = (a + 2n)ir(X) n u. 

This means that either Tr(X) n u = 0, or else ir(X) n u is an eigenvector for n(H) with 
eigenvalue (a + 2n) . 

Now, an operator on a finite-dimensional space can have only finitely many 
distinct eigenvalues. Thus the 7r(X)™u's cannot all be different from zero. Thus 
there is some N >Q such that 

k{X) n u^0 

but 

tt(X) n+1 u = Q. 
Define u = ir(X) N u and A = a + 2N. Then 

(5.7) ir(H)u = Xu a 

(5.8) Tr{X)u a = 
Then define 

U k = TT(Y) k U 

for k > 0. By the second part of the lemma, we have 

(5.9) Tr(H)u k = (\-2k)u k . 

Since, again, n(H) can have only finitely many eigenvalues, the u^s cannot all be 
non-zero. 

Lemma 5.11. With the above notation, 

TT{X)u k = [kX-k(k-l)]u k -i (fc>0) 

n(X)u = 0. 

PROOF. We proceed by induction on k. In the case k = 1 we note that u\ = 
7r(Y)uo. Using the commutation relation [ir(X), n(Y)] = n(H) we have 

tt(X) Ui - 7r(X)7r(Y)u = {tt(Y)tt(X) + n(H)) u . 

But n(X)uo — 0, so we get 

n(X)ui = Xu 
which is the lemma in the case k = 1. 
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Now, by definition u k+ i — ir(Y)u k . Using ( j5j| ) and induction we have 

Tr(X)u k+1 = Tr(X)n(Y)u k 
= {ir(Y)w(X) + n(H)) u k 
= tt(Y) [kX - k(k - 1)] u k -i + (A — 2k)u k 
= [kX - k(k - 1) + (A - 2k)] u k . 
Simplifying the last expression give the Lemma. □ 

Since tt(H) can have only finitely many eigenvalues, the u k s cannot all be 
non-zero. There must therefore be an integer to > such that 

U k = 7T(y) fe 71 + 



for all k < m, but 



.1 = n(Y) m+1 u = 0. 



Now if u m+ i = 0, then certainly ir(X)u m+ i = 0. Then by Lemma 5.11 

= ir(X)u rn+ i = [(to + 1)A — m(m + 1)] u m = (to + 1)(A — m)u m . 

But u m 0, and m + 1 ^ (since to > 0). Thus in order to have (m+l)(A — m)u m 
equal to zero, we must have A = to. 

We have made considerable progress. Given a finite-dimensional irreducible 
representation n of si (2; C), acting on a space V, there exists an integer to > and 
non-zero vectors Uo, ■ ■ • u m such that (putting A equal to to) 

Tr(H)u k = (to - 2k)u k 

ir{Y)u k = u k+ i (k < to) 

n{Y)u m = 

Tr(X)u k = [km - k(k - 1)] u fe _i (k > 0) 
(5.10) n{X)u = 

The vectors uq, ■ • ■ u m must be linearly independent, since they are eigenvectors 
of tt(H) with distinct eigenvalues. Moreover, the (to + l)-dimensional span of 
uo,---u m is explicitly invariant under tv(H), 7r(X), and tt(Y), and hence under 
n(Z) for all Z £ sl(2;C). Since tt is irreducible, this space must be all of V. 

W e ha ve now shown that every irreducible representation of sl (2;C) is of the 
form ( l5.10| ). It remains to show that everything of the form ( 5.10 ) is a represen- 



tati on, a nd that it is irreducible. That is, if we define tt(H), k(X), and ir(Y) 
by (M0|) (where the Wfc's are basis elements for some (to + l)-dimensional vector 



space), then we want to show that they have the right commutation relations to 
form a representation of si (2; C), and that this representation is irreducible. 

The computation of the commutation relations of tt(H), tt(X), and tt(Y) is 
straightforward, and is left as an exercise. Note that when dealing with tt(Y), you 
should treat separately the vectors u k , k < to, and u m . Irreducibility is also easy 



to check, by imitating the proof of Proposition 5.7. (See Exercise g.) 

We have now shown that there is an irreducible representation of sl(2;C) in 
each dimension m + 1, by explicitly writing down how H, X, and Y should act 



(Equation 5.10) in a basis. But we have shown more than this. We also have 



shown that any (to + l)-dimensional irreducible representation of sl(2;C) must be 



of the form (5.10). It follows that any two irreducible representations of sl(2;C) 
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of dimension (m +1) must be equivalent. For if 7Ti and 772 are two irreducible 
representations of dimension (m +1), acting on spaces V\ and V2, then V\ has a 
basis mq, • • - Mm as in ( 5.1 0| ) and V2 has a similar basis mq, • • -u m . But then the 



map 4> : Vi — > V2 which sends Uk to Uk will be an isomorphism of representations. 
(Think about it.) 

In particular, the (m+ l)-dimensional representation 7r m described in Section^ 
must be equivalent to flB.lOp .This can be seen explicitly by introducing the following 
basis for V m : 

u k = [n m (Y)] k (z?) = (-l)^-^— z ^- k (k < m). 

[m — k)\ 

Then by definition ir m {Y)uk = w/c+i (k < m), and it is clear that ir m (Y)u m = 0. 
It is easy to see that Tr m (H)uk = (m — 2k)uk- The only thing left to check is the 
behavior of Tr m (X). But direct computation shows that 

TT m (X)uk — k(m — k + l)uk-i — [km — k(k — 1)] u^-i- 

as required. 



This completes the proof of Theorem 5.9. □ 



5. Direct Sums of Representations and Complete Reducibility 

One way of generating representations is to take some representations you know 
and combine them in some fashion. We will consider two methods of generating 
new representations from old ones, namely direct sums and tensor products of 
representations. In this section we consider direct sums; in the next section we look 
at tensor products. (There is one other standard construction of this sort, namely 
the dual of a representation. See Exercise pr[ ) 

Definition 5.12. Let G be a matrix Lie group, and let ili,ri2,- • - Ii n be rep- 
resentations of G acting on vector spaces V\, V2, • • • V n . Then the direct sum of 
III, II2, ■ • • n n is a representation Hi®- ■ -®II„ of G acting on the space V\®- ■ -®V n , 
defined by 

pi e • • • © n n (A)] («!,••■«„) = {Hi(A)vi,--- ,u n (A)v n ) 

for all AeG. 

Similarly, if q is a Lie algebra, and 7Ti , 772 , • • • n n are representations of Q acting 
on Vi, V2, ■ ■ ■ V n , then we define the direct sum of tt\, 772, • • • 7T n , acting on V\ © 
"•®V n by 

[77i©---©77„pO] (v ir --v n ) = (ri{X)vu-'- ,ir n (X)v n ) 
for all X E g. 

It is trivial to check that, say, IT © • • • © II„ is really a representation of G. 

Definition 5.13. A finite- dimensional representation of a group or Lie alge- 
bra, acting on a space V , is said to be completely reducible if the following 
property is satisfied: Given an invariant subspace W <ZV , and a second invariant 
subspace U C W C V , there exists a third invariant subspace U C W such that 
U(1U = {0} andU + U = W. 

The following Proposition shows that complete reducibility is a nice property 
for a representation to have. 



so 
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Proposition 5.14. A finite- dimensional completely reducible representation of 
a group or Lie algebra is equivalent to a direct sum of (one or more) irreducible 
representations. 

Proof. The proof is by induction on the dimension of the space V. If dim V — 
1, then automatically the representation is irreducible, since then V is has no non- 
trivial subspaces, let alone non-trivial invariant subspaces. Thus V is a direct sum 
of irreducible representations, with just one summand, namely V itself. 

Suppose, then, that the Proposition holds for all representations with dimension 
strictly less than n, and that dim V = n. If V is irreducible, then again we have 
a direct sum with only one summand, and we are done. If V is not irreducible, 
then there exists a non-trivial invariant subspace U C V. Taking W = V in the 
definition of complete reducibility, we see that there is another invariant subspace 
U with U n U = {0} and U + U = V. That is, V = U U as a vector space. 

But since U and U are invariant, they can be viewed as representations in 
their own right. (That is, the action of our group or Lie algebra on U or U is 
a representation.) It is easy to see that in fact V is isomorphic to U ffi U as a 
representation. Furthermore, it is easy to see that both U and U are completely 
reducible representations, since every invariant subspace W of, say, U is also an 
invariant subspace of V. But since U is non-trivial (i.e., U ^ {0} and U ^ V), 
we have dim U < dim V and dim U < dim V. Thus by induction U = U\ • • • U n 
(as representations), with the Ui's irreducible, and U = Ui ® ■ ■ ■ U m , with the E/j's 
irreducible, so that V = Ux © • • • U n 8 U\ ® ■ ■ ■ U m . □ 

Certain groups and Lie algebras have the property that every (finite-dimensional) 
representation is completely reducible. This is a very nice property, because it im- 
plies (by the above Proposition) that every representation is equivalent to a direct 
sum of irreducible representations. (And, as it turns out, this decomposition is 
essentially unique.) Thus for such groups and Lie algebras, if you know (up to 
equivalence) what all the irreducible representations are, then you know (up to 
equivalence) what all the representations are. 

Unfortunately, not every representation is irreducible. For example, the stan- 
dard representation of the Heisenberg group is not completely reducible. (See Ex- 
ercise |[) 

Proposition 5.15. Let G be a matrix Lie group. Let II be a finite- dimensional 
unitary representation of G, acting on a finite- dimensional real or complex Hilbert 
space V . Then LT is completely reducible. 

Proof. So, we are assuming that our space V is equipped with an inner prod- 
uct, and that 11(A) is unitary for each A S G. Suppose that W C V is invariant, 
and that U C W C V is also invariant. Define 

u = u x n w. 

Then of course U D U = {0}, and standard Hilbert space theory implies that 

u + u = w. 

It remains only to show that U is invariant. So suppose that v S U n W. 
Since W is assumed invariant, H(A)v will be in W for any A S G. We need to 
show that F±(v4)i; is perpendicular to U. Well, since iI(A _1 ) is unitary, then for any 
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u G U 

(u,il(A)v) = (n(^- 1 )u,n(A- 1 )n(A)w) = (n( J 4- 1 )w,w) . 

But U is assumed invariant, and so G J7. But then since u € 17 , 

<n(^- 1 )u,v) = 0. This means that 

(u,U(A)v) = 

for all u eU, i.e., II (A) u G U^. 

Thus f7 is invariant, and we are done. □ 



Proposition 5.16. If G is a finite group, then every finite- dimensional real or 
complex representation of G is completely reducible. 

Proof. Suppose that IT is a representation of G, acting on a space V. Choose 
an arbitrary inner product ( ) on V. Then define a new inner product ( ) G on V 

by 

(vi,v 2 ) G = ^2 (U(g)vi,U(g)v 2 ) ■ 
g€G 

ft is very easy to check that indeed ( ) G is an inner product. Furthermore, if h G G, 
then 

(U(h) Vl ,U(h)v 2 ) G = (U(g)Il(h)v u U(g)U(h)v 2 ) 
g&G 

= J2(n(9h)v U Il(9h)v 2 ). 

gee 

But as g ranges over G, so does gh. Thus in fact 

(R(h)vi,n{h)v 2 ) G = (vi,v 2 ) G . 

That is, IT is a unitary representation with respect to the inner product () G - 
Thus II is completely reducible by Proposition |5.15 . □ 



There is a variant of the above argument which can be used to prove the 
following result: 

Proposition 5.17. If G is a compact matrix Lie group, then every finite- 
dimensional real or complex representation of G is completely reducible. 

Proof. This proof requires the notion of Haar measure. A left Haar mea- 
sure on a matrix Lie group G is a non-zero measure \x on the Borel c-algebra in 
G with the following two properties: 1) it is locally finite, that is, every point in 
G has a neighborhood with finite measure, and 2) it is left-translation invariant. 
Left-translation invariance means that /i (gE) — /x (E) for all g G G and for all 
Borel sets E c G, where 

gE = {ge\eeE}. 

It is a fact which we cannot prove here that every matrix Lie group has a left Haar 
measure, and that this measure is unique up to multiplication by a constant. (One 
can analogously define right Haar measure, and a similar theorem holds for it. Left 
Haar measure and right Haar measure may or may not coincide; a group for which 
they do is called unimodular.) 
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Now, the key fact for our purpose is that left Haar measure is finite if and 
only if the group G is compact. So if II is a finite-dimensional representation of a 
compact group G acting on a space V, then let ( ) be an arbitrary inner product 
on V, and define a new inner product ( ) G on V by 

(vi,v 2 ) G = (II(>)ui,n(>)u2) dfi(g), 
Jg 

where [i is left Haar measure. Again, it is easy to check that ( ) G is an inner 
product. Furthermore, if h G G, then by the left-invariance of \i 

(u(h)v 1 ,n(h)v 2 ) G = [ (iiigMh^MgMh^) d^( g ) 

Jg 



{U{gh)v u U{gh)v 2 ) dfx(g) 

IG 

= (vi,v 2 ) G - 

So II is a unitary representation with respect to ( ) G , and thus completely reducible. 
Note that ( ) G is well-defined only because /i is finite. □ 

6. Tensor Products of Representations 

Let U and V be finite-dimensional real or complex vector spaces. We wish to 
define the tensor product of U and V, which is will be a new vector space U <S)V 
"built" out of U and V. We will discuss the idea of this first, and then give the 
precise definition. 

We wish to consider a formal "product" of an element u of U with an element 
v of V, denoted u®v. The space U <g> V is then the space of linear combinations of 
such products, i.e., the space of elements of the form 

(5.11) aiUi <S> i»i + a 2 u 2 ® v 2 + ■ ■ ■ + a n u n (g> v n . 

Of course, if "®" is to be interpreted as a product, then it should be bilinear. That 
is, we should have 

(u\ + au 2 ) eg) v = ui ® v + au 2 ® v 
u ® (vi + av 2 ) = u ($ Vi + au ® v 2 - 

We do not assume that the product is commutative. (In fact, the product in the 
other order, v ® u, is in a different space, namely, V <g> U.) 

Now, if ei, e 2 , ■ ■ ■ , e n is a basis for U and ft, f 2 , ■ ■ ■ , f m is a basis for V, then 



using bilinearity it is easy to see that any element of the form (5.11) can be written 
as a linear combination of the elements ej ® fj ■ In fact, it seems reasonable to 
expect that {e^ ® fj |0 < i < n, < j < m} should be a basis for the space U ®V. 
This in fact turns out to be the case. 

Definition 5.18. If U and V are finite- dimensional real or complex vector 
spaces, then a tensor product of U with V is a vector space W , together with a 
bilinear map <fi : U x V — > W with the following property: If ip is any bilinear map 
ofUxV into a vector space X, then there exists a unique linear map ipofW into 
X such that the following diagram commutes: 

U x V ^ W 
X 
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Note that the bilinear map ip from U x V into X turns into the linear map ip 
of W into X. This is one of the points of tensor products: bilinear maps on U x V 
turn into linear maps on W . 

Theorem 5.19. IfU and V are any finite- dimensional real or complex vector 
spaces, then a tensor product (W, <p) exists. Furthermore, (W, </>) is unique up to 
canonical isomorphism. That is, if (Wi,4>i) and (W2,4>2) are two tensor products, 
then there exists a unique vector space isomorphism <I> : W\ — > W% such that the 
following diagram commutes 

U x V ^ W x 
02 \ / $ • 

W 2 

Suppose that (W, (p) is a tensor product, and that ei,e2,--- , e„ is a basis for 
U and fi, fi-, • • • , fm is a basis for V . Then {<p(ei, fj) |0 < i < n,0 < j < m} is a 
basis for W . 

Proof. Exercise |l2|. □ 

Notation 5.20. Since the tensor product ofU and V is essentially unique, we 
will letU ®V denote an arbitrary tensor product space, and we will write u®v in- 
stead oftf)(u, v). In this notation, the Theorem says that {ei ® fj |0 < i < n, < j < m} 
is a basis for U (£> V , as expected. Note in particular that 

dim (U <8> V) = (dim U) (dim V) 

(not i\m.U + dim V). 

The defining property of U ® V is called the universal property of tensor 
products. While it may seem that we are taking a simple idea and making it 
confusing, in fact there is a point to this universal property. Suppose we want to 
define a linear map T from U ® V into some other space. The most sensible way to 
define this is to define T on elements of the form u ® v. (You might try defining it 
on a basis, but this forces you to worry about whether things depend on the choice 
of basis.) Now, every element of £7(8 V is a linear combination of things of the form 

However, this representation is far from unique. (Since, say, if u = Ui + 
then you can rewrite u ® v as u\ ® v + u 2 ® v.) 

Thus if you try to define T by what it does to elements of the form u (8 v, 
you have to worry about whether T is well-defined. This is where the universal 
property comes in. Suppose that ip( u T v ) is some bilinear expression in u, v. Then 
the universal property says precisely that there is a unique linear map T (= 
such that 

T(u ® v) = tp(u, v). 

(Think about it and make sure that you see that this is really what the universal 
property says.) 

The conclusion is this: You can define a linear map T on U (8 V by defining it 
on elements of the form u®v, and this will be well-defined, provided that T(u (8 v) 
is bilinear in (u, v). The following Proposition shows how to make use of this idea. 
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Proposition 5.21. Let U and V be finite- dimensional real or complex vector 
spaces. Let A : U — > U and B : V — > V be linear operators. Then there exists a 
unique linear operator from U ®V to U ®V , denoted A® B, such that 

A <g> B{u ®v) = (Au) <g> (Bv) 

for all u eU, v g V. 

If Ai,A 2 are linear operators on U and Bi,B 2 are linear operators on V, then 

(A 1 ® B x ) (A 2 ® B 2 ) = (A X A 2 ) <8> (BiB 2 ) • 

Proof. Define a map t/> from J7 x V into U ® V by 

ip(u,v) = (Au) (g) . 

Since ^4 and B are linear, and since £g> is bilinear, r/> will be a bilinear map of U x V 
into 17 <g> V. But then the universal property says that there is an associated linear 
map $ :U ® V — > U <g> V such that 

ij){u ® v) = tp(u, v) — (Au) ® (Bv) . 

Then r/> is the desired map A ® B. 

Now, if Ai,A 2 are operators on ?7 and Bi,B 2 are operators on V, then compute 

that 

(Ai <g> Bi) (A 2 ® B 2 ) (u ® v) = (Ai ® Si) (A 2 u ® B 2 w) 

= A\A 2 u ® B\B 2 v. 

This shows that (Ai ® Bi) (A 2 <8> B 2 ) = (^4i^4 2 ) <8> (B\B 2 ) are equal on elements of 
the form u(gw. Since every element of U®V can be written as a linear combination of 
things of the form u®v (in fact of e^fj), (At <g> Bi) (A 2 ® B 2 ) and (A 1 A 2 )®(B 1 B 2 ) 
must be equal on the whole space. □ 

We are now ready to define tensor products of representations. There arc two 
different approaches to this, both of which are important. The first approach starts 
with a representation of a group G acting on a space V and a representation of 
another group H acting on a space U, and produces a representation of the product 
group G x H acting on the space U ® V . The second approach starts with two 
different representations of the same group G, acting on spaces U and V, and 
produces a representation of G acting on U ® V. Both of these approaches can be 
adapted to apply to Lie algebras. 

Definition 5.22. Let G and H be matrix Lie groups. Let Hi be a represen- 
tation of G acting on a space U and let U 2 be a representation of H acting on a 
space V. The the tensor product of P±i and H 2 is a representation Ui ® n 2 of 
G x H acting on U ®V defined by 

n x ® n 2 (A, b) = iii (A) ® n 2 (b) 

for all Ae G and B g H. 

Using the above Proposition, it is very easy to check that indeed III ® n 2 is a 
representation of G x H. 

Now, if G and H are matrix Lie groups, that is, G is a closed subgroup of 
GL(n; C) and H is a closed subgroup of GL(m; C), then G x H can be regarded in 
an obvious way as a closed subgroup of GL(n + to; C). Thus the direct product of 
matrix Lie groups can be regarded as a matrix Lie group. It is easy to check that 
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the Lie algebra of G x H is isomorphic to the direct sum of the Lie algebra of G 
and the Lie algebra of H. See Exercise 

In light of Proposition 5.4, the representation 111 £g> I±2 of G x H gives rise to a 
representation of the Lie algebra of G x H, namely 0© f). The following Proposition 
shows that this representation of g ffi [) is not what you might expect at first. 



Proposition 5.23. LetG and H be matrix Lie groups, let Hi, H2 be represen- 
tations of G, H respectively, and consider the representation III <g) II2 of G x H . Let 
7Ti <g> 7T2 denote the associated representation of the Lie algebra of G x H , namely 
g © f) . Then for all X e g and Y G f) 

7Ti (g>7T 2 (X,y) =7ri(X)(g)/ + /(g)7r 2 (r). 

Proof. Suppose that u(t) is a smooth curve in U and is a smooth curve 
in V. Then we verify the product rule in the usual way: 



lim 



u(t + h)® v(t + h) - u(t) <g> v(t) 



u(t + h) <g> v(t + h) - u(t + h)® v(t) u{t + h)® v(t) - u(t) <g> v(t) 
lim 1 



lim 



u(t + h) 



(v(t + h) -v(t)) 



+ lim 

h— >Q 



(u(t + /l) - « (t)) 



® u(i) 



Thus 



d , , . ... , . . . dv 

— (u(t) «(*)) = — <g) w(t) + u(t) ® — . 



This being the case, we can compute 7Ti ® 7r 2 (X, Y): 



7Ti 7r 2 (X,F)(u(g)u) = — 
at 



Ui(E>n 2 (e tx ,e tY )(u(E)v) 



t=o 
di 



ni(e tx )u® n 2 (e 



Tli(e tJi )u 



t=o 



U + " d£ 



n 2 ( e "> 



t=0 



This shows that ttl <£> tt 2 (X, Y~) = tti(X) ® L + L <S> 7r 2 (F) on elements of the form 
u<8>v, and therefore on the whole space U V. □ 



Definition 5.24. Let g and f) be Lie algebras, and let tti and it2 be represen- 
tations of g and f), acting on spaces U and V. Then the tensor product ofiti and 
7T2, denoted tti ® 7r 2; is a representation of g © t) acting on U ®V , given by 

7Ti <g)7T 2 (X,y) = 7Ti(X) <g> 7 + 7® 7T 2 (F) 

/or all X E g and Y E t). 

It is easy to check that this indeed defines a representation of g © f). Note that 
if we defined 7Ti ® tt 2 (X, F) = 7Ti(X) C3>7r 2 (Y), this would not be a representation of 
gffif), for this is not even a linear map. (E.g., we would then have 7ri<g)7r 2 (2X, 2Y) = 
Atti <g) 7r 2 (X, Y)l) Note also that the above definition applies even if iri and 7T2 do 
not come from a representation of any matrix Lie group. 
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Definition 5.25. Let G be a matrix Lie group, and let IT and n 2 be repre- 
sentations of G, acting on spaces V\ and V2. Then the tensor product 0/II1 and 
il 2 is a representation of G acting on V\ <8> V2 defined by 

ni®n 2 (A) =n 1 (A)®n a (A) 

for all AeG. 



Proposition 5.26. With the above notation, the associated representation of 
the Lie algebra g satisfies 

7T1 ®7T 2 (X) = TTl(X) <g> 1 + I<g>7T 2 (X) 

/or aZZ leg. 

Proof. Using the product rule, 



7Tl <g> 7T 2 (X) (u ® u) = — 



ni (e tx )u®Tl2 {e tx )v 



t=o 



7Tl (X) U ® V + V ® 7T2 (X) It. 



This is what we wanted to show. 



□ 



Definition 5.27. If g is a Lie algebra, and w± and 7r 2 are representations of g 
acting on spaces V% and V 2 , i/ien t/ie tensor product of ~k\ and 7r 2 is a represen- 
tation of g acting on the space V\ (g> V" 2 defined by 

7Ti ®7T 2 (X) = 7Tl(X)<8> J + J®7T 2 (X) 

/or aZZ leg. 

It is easy to check that IIi ® n 2 and m ® 7r 2 are actually representations of G 
and g, respectively. There is some ambiguity in the notation, say, IT ® II 2 . For 
even if ITi and II 2 are both representations of the same group G, we coul d still 
regard IIi (g) II 2 as a representation of G x G, by taking H — G in definition ^>,22 . 
We will rely on context to make clear whether we are thinking of Hi eg) II 2 as a 
representation of G x G or as representation of G. 

Suppose IT and II 2 are irreducible representations of a group G. If we re- 
gard IT <£> n 2 as a representation of G, it may no longer be irreducible. If it is 
not irreducible, one can attempt to decompose it as a direct sum of irreducible 
representations. This process is called Clebsch-Gordan theory. In the case of 
SU(2), this theory is relatively simple. (In the physics literature, the problem of 
analyzing tensor products of representations of SU(2) is called "addition of angular 
momentum.") See Exercise [l5[ 

7. Schur's Lemma 

Let II and S be representations of a matrix Lie group G, acting on spaces V 
and W. Recall that a morphism of representations is a linear map (j> : V — > W 
with the property that 

4>(Il(A)v) = E{A) (0(„)) 

for all v G V and all A £ G. Schur's Lemma is an extremely important result which 
tells us about morphisms of irreducible representations. Part of Schur's Lemma 
applies to both real and complex representations, but part of it applies only to 
complex representations. 
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It is desirable to be able to state Schur's lemma simultaneously for groups and 
Lie algebras. In order to do so, we need to indulge in a common abuse of notation. 
If, say, II is a representation of G acting on a space V, we will refer to V as the 
representation, without explicit reference to II. 

Theorem 5.28 (Schur's Lemma). 1. Let V and W be irreducible real or 
complex representations of a group or Lie algebra, and let <fi : V —> W be a 
morphism. Then either <fi = or (j) is an isomorphism. 

2. Let V be an irreducible complex representation of a group or Lie algebra, 
and let cj> : V —> V be a morphism of V with itself. Then cf> — XI , for some 

XeC. 

3. Let V and W be irreducible complex representations of a group or Lie algebra, 
and let <fii, (f>2 : V —> W be non-zero morphisms. Then </>i = X(f>2, for some 

A e c. 

COROLLARY 5.29. Let II be an irreducible complex representation of a matrix 
Lie group G. If A is in the center of G, then 11(A) = XI . Similarly, if 7r is an 
irreducible complex representation of a Lie algebra g, and if X is in the center of g 
(i.e., [X,Y] = for all Y E g), then n(X) = XI. 

Proof. We prove the group case; the proof of the Lie algebra case is the same. 
If A is in the center of G, then for all B E G, 

U(A)U(B) = U(AB) = IL(BA) = U(B)U(A). 

But this says exactly that 11(A) is a morphism of II with itself. So by Point 2 of 
Schur's lemma, 11(A) is a multiple of the identity. □ 

Corollary 5.30. An irreducible complex representation of a commutative group 
or Lie algebra is one-dimensional. 

Proof. Again, we prove only the group case. If G is commutative, then the 
center of G is all of G, so by the previous corollary 11(A) is a multiple of the identity 
for each A 6 G. But this means that every subspace of V is invariant! Thus the 
only way that V can fail to have a non-trivial invariant subspace is for it not to have 
any non-trivial subspaces. This means that V must be one-dimensional. (Recall 
that we do not allow V to be zero-dimensional.) □ 

Proof. As usual, we will prove just the group case; the proof of the Lie algebra 
case requires only the obvious notational changes. 

Proof o/^[ Saying that is a morphism means <fi(H(A)v) = S(A) (</>(«)) for all 
v E V and all A E G. Now suppose that v E ker (</>). Then 

<j)(TL(A)v) = E(A)(j>(v) = 0. 

This shows that ker <fi is an invariant subspace of V. Since V is irreducible, we must 
have ker <fi = or ker <f> = V. Thus <f> is either one-to-one or zero. 

Suppose 4> is one-to-one. Then the image of ^> is a non-zero subspace of W . On 
the other hand, the image of <f> is invariant, for if w E W is of the form <f)(v) for 
some v E V, then 

E(A)w = E(A)0(u) = 4>{IL(A)v). 

Since W is irreducible and image(y) is non-zero and invariant, we must have 
image(I / ) = W. Thus (j) is either zero or one-to-one and onto. 
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Proof o/|^. Suppose now that V is an irreducible complex representation, and 
that <t> : V — » V is a morphism of V to itself. This means that (f)Tl(A) = H(A)<f> 
for all A G G, i.e., that </> commutes with all of the 11(A) 's. Now, since we are over 
an algebraically complete field, 4> must have at least one eigenvalue A £ C. Let U 
denote the eigenspace for <f> associated to the eigenvalue A, and let u € U . Then for 
each A e G 

<f>(Il(A)u) = n{A)c/)(v) = Xtt{A)u. 

Thus applying II (A) to an eigenvector of (f> with eigenvalue A yields another eigen- 
vector of (f> with eigenvalue A. That is, C/ is invariant. 

Since A is an eigenvalue, U 7^ 0, and so we must have U = V. But this means 
that 4>(v) = \v for all v € V, i.e., that <f> = XI. 

Proof o/[|. If 02 7^ 0, then by (1) 4>2 is an isomorphism. Now look at cf>i o 
As is easily checked, the composition of two morphisms is a morphism, so 4>\o (j)^ 1 
is a morphism of W with itself. Thus by (2), 0i o (/j^ 1 = A/, whence </>i = \4>2- d 

8. Group Versus Lie Algebra Representations 



We know from Chapter 3 (Theorem 3. IS) that every Lie group homomorphism 



gives rise to a Lie algebra homomorphism. In particular, this shows (Proposition 



5.4 ) that every representation of a matrix Lie group gives rise to a representation of 
the associated Lie algebra. The goal of this section is to investigate the reverse pro- 
cess. That is, given a representation of the Lie algebra, under what circumstances 
is there an associated representation of the Lie group? 



The climax of this section is Theorem 5.33, which states that if G is a con- 
nected and simply connected matrix Lie group with Lie algebra g, and if 7r is a 
representation of g, then there is a unique representation IT of G such that II and 



7r are related as in Proposition 5.4. Our proof of this theorem will make use of the 
Baker-Campbell-Hausdorff formula from Chapter 4. Before turning to this general 
theorem, we will examine two special cases, namely SO(3) and SU(2), for which we 
can work things out by hand. See Brocker and torn Dieck, Chapter II, Section 5. 



We have shown (Theorem 5.9) that every irreducible complex representation of 
su(2) is equivalent to one of the representations 7r m described in Section |[ (Recall 
that the irreducible complex representations of su(2) are in one-to-one correspon- 
dence with the irreducible representations of si (2; C).) Each of the representations 
7r m of su(2) was constructed from the corresponding representation n m of the group 
SU(2). Thus we see, by brute force computation, that every irreducible complex 
representation of su(2) actually comes from a representation of the group SU(2)! 
This is consistent with the fact that SU(2) is simply connected (Chapter 2, Prop. 



2.12|) 



Let us now consider the situation for SO (3). (Which is not simply connected.) 
We know from Exercise |l0| of Chapter 3 that the Lie algebras su(2) and so(3) are 
isomorphic. In particular, if we take the basis 

F _ 1 ( i 0\ _ x / 1\ _ x ( i 



2 \ -i ) z 2 V -1 ) 6 2 V i 
for su(2) and the basis 


Fi = I -1 I F 2 = I ii (i j F 3 
1 
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then direct computation shows that [Ei,E 2 ] = E 3 , [E 2 ,E 3 ] = Ei, [Es,Ei] = E 2 , 
and similarly with the E's replaced by the F's. Thus the map (j> : so (3) — > su(2) 
which takes -F, to Ei will be a Lie algebra isomorphism. 

Since su(2) and so(3) are isomorphic Lie algebras, they must have "the same" 
representations. Specifically, if n is a representation of su(2), then n o will be 
a representation of so(3), and every representation of so(3) is of this form. In 
particular, the irreducible representations of so(3) are precisely of the form a rn — 
7r r „ o cj). We wish to determine, for a particular m, whether there is a representation 



of the group S0(3) such that o~ m and S TO are related as in Proposition 5.4 



Proposition 5.31. Leta rn — ir m o(j) be the irreducible complex representations 
of the Lie algebra so(3) (m > 0). If m is even, then there is a representation E m 



of the group SO(3) such that a rn and E m are related as in Proposition 5.4- If m is 
odd, then there is no such representation o/SO(3). 

Note that the condition that m be even is equivalent to the condition that 
dim V m = m + 1 be odd. Thus it is the odd-dimensional representations of the Lie 
algebra so(3) which come from group representations. 

In the physics literature, the representations of su(2)/so(3) are labeled by the 
parameter I = m/2. In terms of this notation, a representation of so(3) comes from 
a representation of SO (3) if and only if I is an integer. The representations with I 
an integer are called "integer spin" ; the others are called "half-integer spin." 

8.0.1. Proof. 

Proof. Case 1: m odd. In this case, we want to prove that there is no rep- 



resentation E m such that o~ m and S TO are related as in Proposition 5.4. (We have 
already considered the case m = 1 in Exercise f7|) Suppose, to the contrary, that 
there is such a S m . Then Proposition ^4| says that 

for all X 6 so(3). In particular, take X = 2ixF\. Then, computing as in Chapter 
3, Section || we see that 

/ 1 \ 

e 2^F x I q CQs2jT _ gin J _ J 

V sin27r cos27T J 
Thus on the one hand E m (e 2,rFl ) = S m (7) = I, while on the other hand E m (e 2wFl ) : 

e 27TCT m (J?l)_ 

Let us compute e 27I ' cr ™( F i). By definition, a m (Fi) = •K m {(j}{Fx}) = n m (Ei). But, 
Ei = \H, where as usual 

' 1 
-1 



H = 



We know that there is a basis uq, u\, ■ ■ ■ , u m for V m such that Uk is an eigenvector 
for Tt rn (H) with eigenvalue m — 2k. This means that Uk is also an eigenvector for 
0m (-Pi) = %iTm(H), with eigenvalue \ ijn — 2k). Thus in the basis {uk} we have 

( l m \ 

|(m-2) 



o- m {Fi) 



\ i(-m) / 
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But we are assuming the m is odd! This means that m — 2k is an odd integer. 
Thus e 2T%(m-2k) _ anc j j n tne b as i s 



2ir<7 m (F 1 ) _ 



/ 6 



V 



27r 4 m 



27r|(m-2) 



ef" 2 



e 27r|(-m) y 

Thus on the one hand, S m (e 27rFl ) = E m (7) = I, while on the other hand S m (e 27rFl ) 
e 2^(T m (Fi) _ _j This is a contradiction, so there can be no such group representa- 
tion S m . 

Case 2: m is even. We will use the following: 

Lemma 5.32. There exists a Lie group homomorphism $ : SU(2) — > SO(3) 
such that 

1) $ maps SU(2) onto SO(3), 

2) ker$ = {1,-1}, and 

3) the associated Lie algebra homomorphism $ : su(2) — > so(3) is an isomor- 
phism which takes Ei to Fi. That is, $ = 

Proof. Exercise [l?]. □ 

Now consider the representations H m of SU(2). I claim that if m is even, then 
II m (— J) = I. To see this, note that 

ni 
— TTl 



e 2 ^ = cxp 



= -I. 



Thus H m (-7) = U m {e 2vEl ) = e ^(2^i). But as in Case 1, 



3 7T m (27rB 1 ) 



2srf(ra-2) 



V 



e 27ri(-ro) y 



Only, this time, m is even, and so \irn — 2k) is an integer, so that II m (— 7) = 

e 7r m (27r,Ei) _ j 

Since n m (-7) = 7, II m (-L7") = n m (C7) for all U G SU(2). According to 
Lemma HH , for each R e SO(3), there is a unique pair of elements {U,—U} such 
that $(J7) = ®(—U) = i?. Since II m ({7) = II TO (— {/), it makes sense to define 

S m (7?) = U m (U). 

It is easy to see that S m is a Lie group homomorphism (hence, a representation). 
By construction, we have 

(5.12) n m = S m o $. 

Now, if S m denotes the Lie algebra representation associated to S TO , then it 
follows from ( |5.12 ) that 

7T m = E TO o $. 

But the Lie algebra homomorphism $ takes Ei to F», that is, $ = So 7r TO = 

S m o (h" 1 , or S m = 7r m o (A. Thus S m = cr m , which is what we want to show. □ 
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It is now time to state the main theorem. 

Theorem 5.33. 1. Let G,H be a matrix Lie groups, let 4>\,(j>2 : G — > H 
be Lie group homomorphisms, and let (pi, 4>2 ■ 9 — ► f) be the associated Lie 
algebra homomorphisms. If G is connected and (f>i — 4>2, then = 02. 

2. Let G, H be a matrix Lie groups with Lie algebras g and f). Let : g — > rj be 
a Lie algebra homomorphism. If G is connected and simply connected, then 
there exists a unique Lie group homomorphism <p : G — ► H such that <p and 



are related as in Theorem 3. It of Chapter 3. 



This has the following corollaries. 

Corollary 5.34. Suppose G and H are connected, simply connected matrix 
Lie groups with Lie algebras g and f). If g = f) then G = H. 



Proof. Let <f> : g — > () be a Lie algebra isomorphism. By Theorem 5.33, there 
exists an associated Lie group homomorphism <fi : G — » H . Since : f) — > q is also 
a Lie algebra homomorphism, there is a corresponding Lie group homomorphism 
ip : H — > G. We want to show that and t/j are inverses of each other. 

Well, 4>o tp = <fi o ijj = 1^ , so by the Point 1 of the Theorem, <f> o ip = Ijj. 
Similarly, ip o <f> — Iq. □ 

Corollary 5.35. 1. Let G be a connected matrix Lie group, let III and 
LT2 be representations of G, and let ~k\ and 7T2 be the associated Lie algebra 
representations. If tt± and it 2 are equivalent, thenlii and LT2 are equivalent. 

2. Let G be connected and simply connected. If 'tt is a representation of g, then 
there exists a representation LT of G, ac ting on the same space, such that LT 



and tt are related as in Proposition 5.4 



Proof. For (1), let I±i act on V and II2 on W. We assume that the associated 
Lie algebra representations are equivalent, i.e., that there exists an invertible linear 
map <f> :V W such that 

<t>{-K l {X)v) = n 2 {X)<l>{v) 

for all X £ g and all v G V. This is the same as saying that <jmi(X) = ir2(X)4>, or 
equivalently that cj)TTi(X)(j)~ 1 = ^(X) (for all X € g). 
Now define a map S 2 : G — >• GL(VK) by the formula 

Ea(A) - 0ni(A)0- x . 

It is trivial to check that £2 is a homomorphism. Furthermore, differentiation shows 
that the associated Lie algebra homomorphism is 

a 2 {X) = ^(X)^ 1 =tt 2 (X) 

for all X. Then by (1) in the Theorem, we must also have £2 = LT2, i.e., 

cpiiiiA)^- 1 =n 2 (A) 

for all A 6 G. But this shows that ill and n2 are equivalent. 

Point (2) of the Corollary follows immediately from Point (2) of the Theorem, 
by taking H = QL{V). □ □ 



We now proceed with the proof of Theorem 5.33 



92 



5. BASIC REPRESENTATION THEORY 



PROOF. Step 1: Verify Point (1) of the Theorem. 



Since G is connected, Corollary 3.26 of Chapter 3 tells us that every element 



A of G is a finite product of the form A — exp X expX ■ ■ ■ expX n , with Xj E g. 
But then if <f>\ = 4>2i we have 

0x (e Xl • • • e x ") = e^ Xl > • • • e^*"* = e^ Xl > • • • e* 2 <*"> = <fe (e Xl • • • e x ") . 
So we now need only prove Point (2). 

Step 2: Define <j) in a neighborhood of the identity. 



Proposition 3.23 of Chapter 3 says that the exponential mapping for G has a 
local inverse which maps a neighborhood V of the identity into the Lie algebra g. 
On this neighborhood V we can define <j) : V — > H by 

cp{A) = exp{#ogA)}. 

That is 

<f> — exp o(j) o log . 

(Note that if there is to be a homomorphism 4> as in Theorem 3.18| of Chapter 3, 
then on V, <f> must be exp o0 o log.) 



It follows from Corollary 4.4 to the Baker-Campbell-Hausdorff formula that this 
is a "local homomorphism." That is, if A and B are in V, and if AB happens 
to be in V as well, then <j)(AB) = tp(A)<j)(B). (See the discussion at the beginning 
of Chapter 4.) 

Step 3: Define <f> along a path. 

Recall that when we say G is connected, we really mean that G is path- 
connected. Thus for any A E G, there exists a path A(t) E G with A(0) = I 
and A(l) = A. A compactness argument shows that there exists numbers = to < 
fi < t-2 ■ ■ ■ < t„ = 1 such that 

(5.13) A(s)A(t i )- 1 E V 

for all s between t t and . 

In particular, for i = 0, we have A(s) G V for < s < t\. Thus we can 



define <j)(A(s)) by Step 2 for s E [0, fi]. Now, for s E [ti,t 2 ] we have by (5.13) 
A(s)A(ti)^ 1 E V. Moving the A(ti) to the other side, this means that for s E [ti, tz] 
we can write 

A(s)= [A^Aih)- 1 ] A(h). 

with A(s)A(ti)^ 1 E V . If 4> is to be a homomorphism, we must have 

(5.14) $ (A(s)) = <t> {[A^Aih)- 1 ] A(h)) = (A^Aih)- 1 ) (A(ti)) . 

But <fi (A(ti)) has already been defined, and we can define (j> (^A(s)A(t 1 )~ 1 ^j by Step 
2. In this way we can use (PH) to define 0(A(s)) for s G [ti,t 2 ]. 



Proceeding on in the same way, we can define cj>(A(s)) successively on each 
interval [ti, U+i] until eventually we have defined (f> (A(s)) on the whole time interval 
[0, 1]. This in particular serves to define (^4(1)) = 4>(A). 



Step 4- Prove independence of path. 
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In Step 3, we "denned" <j)(A) by denning 4> along a path joining the identity to 
A. For this to make sense as a definition of 4>(A) we have to prove that the answer 
is independent of the choice of path, and also, for a particular path, independent 
of the choice of partition (to, tx, ■ ■ ■ t n ). 

To establish independence of partition, we first show that passing from a par- 
ticular partition to a refinement of that partition doesn't change the answer. (A 
refinement of a partition is one which contains all the points of the original partition, 
plus some other ones.) This is proved by means of the Baker-Campbcll-Hausdorff 
formula. For example, suppose we insert an extra partition point s between to and 
tx. Under the old partition we have 

(5.15) 4>(A(tx))= expose log (A(tx)). 
Under the new partition we write 

A(tx) = [A^A^y 1 ] A(s) 

so that 

(5.16) <f>(A{tx)) = expose log (A(t l )A(sy 1 ) expose \og(A(s)) . 

But (as noted in Step 2), Corollary [O] of the Baker-Campbell-Hausdorff for- 
mula (Chapter 4, Section ||) implies that for A and B sufficiently near the identity 

exp o0 o log(AB) = exp o<f> o \og(A) exp o</> o log(£?) 



Thus the right sides of ( 5.15 ) and ( 5.16| ) are equal. Once we know that passing to a 



refinement doesn't change the answer, we have independence of partition. For any 
two partitions of [0, 1] have a common refinement, namely, the union of the two. 

Once we know independence of partition, we need to prove independence of 
path. It is at this point that we use the fact that G is simply connected. In 
particular, because of simple connectedness, any two paths Ax(t) and A 2 (t) joining 
the identity to A will be homotopic with endpoints fixed. (This is a standard 
topological fact.) Using this, we want to prove that Step 3 gives the same answer 
for Ai and A 2 . 

Our strategy is to deform Ax into A 2 in a series of steps, where during each 
step we only change the path in a small time interval (t, t + e), keeping everything 
fixed on [0, t] and on [t + e, 1] . Since we have independence of partition, we can take 
t and t + e to be partition points. Since the time interval is small, we can assume 
there are no partition points between t and t + e. Then we have 

(A(t + e)) = (A(t + e)A{ty 1 ) (A(i)) 

where <j> (A(t + e)A(ty 1 ) is defined as in Step 2. 

But notice that our value for (f> (A(t + e)) depends only on A (t) and A (t + e), 
not on how we get from A (t) to A(t + e)\ Thus the value </> (A(t + e)) doesn't 
change as we deform the path. But if <fi (A(t + e)) doesn't change as we deform the 
path, neither does (p (A(l)), since the path isn't changing on [t + e, 1]. 

Since Ax and A 2 are homotopic with endpoints fixed, it is possible (by a stan- 
dard topological argument) to deform A\ into A 2 in a series of small steps as above. 

Step 5: Prove that (j) is a homomorphism, and is properly related to 4>- 
Now that we have independence of path (and partition), we can give a simpler 
description of how to compute (f>. Given any group element A, A can be written in 
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the form 

A = C n C n -i ■ ■ ■ C\ 

with each Ci in V. (This follows from the (path-)connectedness of G.) We can then 
choose a path A(t) which starts at the identity, then goes to Ci, then to C2C1, and so 
on to C„C„_i • • • C\ = A. We can choose a partition so that A{ti) — CiCi-x ■ ■ ■ C\. 
By the way we have defined things 

4,{A) = cj, (A(l)A(t^x)- 1 ) 4, {A{t n ^)A^n-iY x ) ■■■4> (A(ti)A(O)) . 

But 

A{t t )A{U^)- 1 = {dd^ ■ ■ ■ Ci) (Ci-! • • • d)" 1 = d 

so 

<t>{A) =0(C n )^»(C„_ 1 )---0(C 1 ). 

Now suppose that A and -B are two elements of G and we wish to compute 
4>{AB). Well, write 

A = C n C n _x ■ ■ ■ G\ 
B = D n D n - 1 ---D 1 . 

Then 

(AB) = (C n C n -i ■ ■ ■ CxD n D n -x ■ ■ ■ Dx) 
= [4>{C n )--^{Cx)] [4>{D n )---4>{D 1 )] 
= 4>(A)cb(B). 

We see then that tj> is a homomorphism. It remains only to verify that (j> 
has the proper relationship to <p. But since <p is defined near the identity to be 
<j) — expocj) o log, we see that 

d_ 

dt 



t<p(x) _ 



t=0 



Thus <p is the Lie algebra homomorphism associated to the Lie group homomor- 
phism (p. 



This completes the proof of Theorem 5.33. □ 



9. Covering Groups 

It is at this point that we pay the price for our decision to consider only matrix 
Lie groups. For the universal covering group of a matrix Lie group (defined below) 
is always a Lie group, but not always a matrix Lie group. For example, the universal 
covering group of SL (n; R) (n > 2) is a Lie group, but not a matrix Lie group. (See 
Exercise PQ .) 

The notion of a universal cover allows us to determine, in the case of a non- 
simply connected group, which representations of the Lie algebra correspond to 



representations of the group. See Theorem 5.41 below 



Definition 5.36. Let G be a connected matrix Lie group. A universal cov- 
ering group of G (or just universal cover) is a connected, simply connected 
Lie group G, together with a Lie group homomorphism <fi : G — > G ( called the 
projection map) with the following properties: 
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1. <j) maps G onto G. 

2. There is a neighborhood U of I in G which maps homeomorphically under 
4> onto a neighborhood V of I in G. 

Proposition 5.37. If G is any connected matrix Lie group, then a universal 
covering group G of G exists and is unique up to canonical isomorphism. 

We will not prove this theorem, but the idea of proof is as follows. We assume 
that G is a matrix Lie group, hence a Lie group (that is, a manifold). As a mani- 
fold, G has a topological universal cover G which is a connected, simply connected 
manifold. The universal cover comes with a "projection map" <j) : G — > G which is 
a local homeomorphism. Now, since G is not only a manifold but also a group, G 
also becomes a group, and the projection map <f> becomes a homomorphism. 

Proposition 5.38. Let G be a connected matrix Lie group, G its universal 
cover, and <f> the projection map from G to G. Suppose that G is a matrix Lie 
group with Lie algebra g. Then the associated Lie algebra map 



9^9 



is an isomorphism. 



In light of this Proposition, we often say that G and G have the same Lie 
algebra. 

The above Proposition is true even if G is not a matrix Lie group. But to make 
sense out of the Proposition in that case, we need the definition of the Lie algebra 
of a general Lie group, which we have not defined. 

Proof. Exercise |l8|. □ 

9.1. Examples. The universal cover of S* 1 is R, and the projection map is the 
map x — > e lx . The universal cover of S0(3 ) is SU(2), and the projection map is the 
homomorphism described in Lemma 5.32| . 



More generally, we can consider SO(n) for n > 3. As it turns out, for n > 3 the 
universal cover of SO(n) is a double cover. (That is, the projection map cj> is two-to- 
one.) The universal cover of SO(n) is called Spin(rt), and may be constructed as a 
certain group of invertible elements in the Clifford algebra over R™. See Brockcr 
and torn Dieck, Chapter I, Section 6, especially Propositions 1.6.17 and 1.6.19. In 
particular, Spin(n) is a matrix Lie group. 

The case n = 4 is quite special. It turns out that the universal cover of S0(4) 
(i.e., Spin(4)) is isomorphic to SU(2) x SU(2). This is best seen by regarding R 4 as 
the quaternion algebra. 

Theorem 5.39. Let G be a matrix Lie group, and suppose that G is also a 
matrix Lie group. Identify the Lie algebra of G with the Lie algebra g of G as in 



Proposition 5. St. Suppose that H is a matrix Lie group with Lie algebra f), and that 



Q — > f) is a homomorphism. Then there exists a unique Lie group homomorphism 



G — ► H such that 4> and 4> are related as in Theorem 3. IS of Chapter 3. 



Proof. G is simply connected. □ 
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COROLLARY 5.40. Let G and G be as in Theorem 5.31, and let n be a repre 



sentation of g. Then there exists a unique representation H of G such that 
for all X E g. 



Theorem 5.41. Let G and G be as in Theorem 5.3i, and let <\> : G — > G . Now 



let tt be a representation of g. and II the associated representation of G, as in the 
Corollary. Then there exists a representation UofG corresponding to tt if and only 
*f 

kerll D ker</>. 

Proof. Exercise [l9|. □ 

10. Exercises 

1. Let G be a matrix Lie group, and g its Lie algebra. Let 111 and LI2 be 
representations of G, and let tt\ and 7r 2 be the associated representations of 



g (Proposition 5.4). Show that if ill and II2 are equivalent representations 
of G, then tti and tt 2 are equivalent representations of g. Show that if G is 
connected, and if tti and 7r 2 are equivalent representations of g, then IL and 
n 2 are equivalent representations of G. 



Hint: Use Corollary 3.26 of Chapter 3. 

2. Let G be a connected matrix Lie group with Lie algebra g. Let II be a 
representation of G acting on a space V, and let tt be the associated Lie 
algebra representation. Show that a subspace W C V is invariant for II if 
and only if it is invariant for tt. Show that II is irreducible if and only if tt 
is irreducible. 

3. Suppose that II is a finite-dimensional unitary representation of a matrix 
Lie group G. (That is, V is a finite-dimensional Hilbert space, and II is 
a continuous homomorphism of G into U(V).) Let tt be the associated 
representation of the Lie algebra g. Show that for each X £ g, tt(X)* = 

-AX). 

4. Show explicitly that the adjoint representation and the standard represen- 
tation are equivalent representations of the Lie algebra so(3). Show that the 
adjoint and standard representations of the group SO(3) are equivalent. 

5. Consider the elements E\, E2, and E3 in su(2) defined in Exercise || of 
Chapter 3. These elements form a basis for the real vector space su(2). 
Show directly that E\, E%, and E3 form a basis for the complex vector space 
sl(2;C). 

6. Define a vector space with basis uq, u\ ■ ■ ■ u m . Now define operators tt(H), 



tt(X), and tt(Y) by formula (5.1C). Verify by direct computation that the op- 
erators defined by fl5.10| ) satisfy the commutation relations [tt(H),tt(X)] = 
2tt{X), [ir(H),7r(Y)] = -2tx(Y), and [tt{X), ir(Y)] = tt(H). (Thus tt(H), 
tt{X), and tt(Y) define a representation of sl(2;C).) Show that this repre- 
sentation is irreducible. 

Hint: It suffices to show, for example, that [tt(H),tt(X)] — 2tt(X) on 
each basis element. When dealing with tt(Y), don't forget to treat separately 
the case of Uk, k < m, and the case of u m - 
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7. We can define a two-dimensional representation of so(3) as follows: 

lfi 
2 V -i 



1/0 1 
2 I -1 



1/0 i 

~ 2 \ i 

(You may assume that this actually gives a representation.) Show that there 
is no group representation II of SO (3) such that II and tt are related as in 


















1 





-1 











1 











-1 











1 





-1 


















Proposition 5.4 



Hint: If X E so(3) is such that e x = I, and II is any representation of 
SO(3), then Tl(e x ) = II (J) = I. 

Remark: In the physics literature, this non-representation of SO(3) is 
called "spin \P 

8. Consider the standard representation of the Heisenberg group, acting on C 3 . 
Determine all subspaces of C 3 which are invariant under the action of the 
Heisenberg group. Is this representation completely reducible? 

9. Give an example of a representation of the commutative group R which is 
not completely reducible. 

10. Consider the unitary representations 11^ of the real Heisenberg group. As- 
sume that there is some sort of associated representation of the Lie alge- 
bra, which should be given by 

'' n» (e tx ) f 

t=o 

(We have not proved any theorem of this sort for infinite-dimensional unitary 
representations.) 

Computing in a purely formal manner (that is, ignoring all technical 
issues) compute 

/ 1 \ / \ / 1 

7T ft ; 7T fi 1 ; 7TR 
\ / \ J \000 

Verify (still formally) that these operators have the right commutation rela- 
tions to generate a representation of the Lie algebra of the real Heisenberg 
group. (That is, verify that on this basis, tth[X, Y] = [%f i (X),TTfi(Y)].) 
Why is this computation not rigorous? 

1 1 . Consider the Heisenberg group over the field Z p of integers mod p, with p 
prime, namely 

1 a b \ 

1 c I |a, b, c £ 1 V 
1/ 

This is a subgroup of the group GL (3; Z p ), and has p 3 elements. 
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Let Vp denote the space of complex- valued functions on Z p , which is 
a p-dimensional complex vector space. For each non-zero n £ Z p , define a 
representation of H p by the formula 

(n„/) (x) = e -^b/p^ncx/pj ^ _ a j xe z ^ 

(These representations are analogous to the unitary representations of the 
real Heisenberg group, with the quantity 2nn/p playing the role of h.) 

a) Show that for each n, II„ is actually a representation of H p , and that 
it is irreducible. 

b) Determine (up to equivalence) all the one-dimensional representations 
of H p . 

c) Show that every irreducible representation of H p is either one-dimensional 
or equivalent to one of the II n 's. 

12. Prove Theorem 5.1£. 

Hints: For existence, choose bases {e^} and {fj} for U and V . Then 
define a space W which has as a basis {wij |0<i<n, 0<j<m}. Define 
4>{&i,fj) — and extend by bilinearity. For uniqueness, use the universal 
property. 

13. Let g and f) be Lie algebras, and consider the vector space g © f). Show that 
the following operation makes g © f) into a Lie algebra 

[(X 1 ,Y 1 ),(X 2 ,Y 2 )] = ([X 1 ,X 2 ],[Y 1 ,Y 2 ]) . 

Now let G and H be matrix Lie groups, with Lie algebras g and f). Show 
that G x H can be regarded as a matrix Lie group in an obvious way, and 
that the Lie algebra of G x H is isomorphic to g © f). 

14. Suppose that n is a representation of a Lie algebra g acting on a finite- 
dimensional vector space V. Let V* denote as usual the dual space of V, 
that is, the space of linear functionals on V. If A is a linear operator on V, 
let A tr denote the dual or transpose operator on V* , 

(A tr cf) (v) = <f>(Av) 

for (j> £ V* , v € V. Define a representation tt* of g on V* by the formula 

7T* (X) = — 7T (X tr ) . 

a) Show that tt* is really a representation of g. 

b) Show that (tt*)* is isomorphic to tt. 

c) Show that 7r* is irreducible if and only if tt is. 

d) What is the analogous construction of the dual representation for 
representations of groups? 

15. Recall the spaces V m introduced in Section [| viewed as representations of 
the Lie algebra sl(2;C). In particular, consider the space V\ (which has 
dimension 2). 

a) Regard V\ ®V\ as a representation of sl(2;C), as in Definition |5.27 . 
Show that this representation is not irreducible. 

b) No w vie w V\ ® V\ as a representation of sl(2;C) © sl(2;C), as in 
Definition 5.24. Show that this representation is irreducible. 

c) More generally, show that V m £g) V n is irreducible as a representation 
of sl(2;C) © sl(2;C), but reducible (except if one of n or m is zero) as a 
representation of sl(2; C). 
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16. Show explicitly that exp : so(3) — > SO (3) is onto. 

Hint: Using the fact that SO(3) C SU(3), show that the eigenvalues of 
R G SO(3) must be of one of the three following forms: (1, 1, 1), (1, —1, —1), 
or (1, e l9 , e~ l8 ). In particular, R must have an eigenvalue equal to one. Now 
show that in a suitable orthonormal basis, R is of the form 

/ 1 
R= cos6> sinfl 
\ — sin 8 cos 9 

17. Proof of Lemma \5.3^ . 

Let {Ei, E 2 , E 3 } be the usual basis for su(2), and {Fi, F 2 , F 3 } be the 
basis for so(3) introduced in Section |[ Identify su(2) with R 3 by identifying 
the basis {E\, E 2 ,E 3 } with the standard basis for R 3 . Consider acLEi, &dE 2 , 
and adi?3 as operators on su(2), hence on R 3 . Show that ad£i = Fi, for 
i = 1,2, 3. In particular, ad is a Lie algebra isomorphism of su(2) onto so(3). 

Now consider Ad : SU(2) -> GL(SU(2)) = GL(3;R). Show that the 
image of Ad is precisely SO(3). Show that the kernel of Ad is {/, — /}. 

Show that Ad : SU(2) — > SO(3) is the homomorphism <!> required by 



Lemma 5.32 



18. Proof of Proposition 5.3< 



Suppose that G and G are matrix Lie groups. Suppose that (j> : G — > G 
is a Lie group homomorphism such that (f> maps some neighborhood U oi I 
in G homeomorphically onto a neighborhood V of I in G. Prove that the 
associated Lie algebra map (f> : q — > g is an isomorphism. 

Hints: Suppose that <p were not one-to-one. Show, then, that there 
exists a sequence of points A n in G with A n ^ I, A n — * / and 4>(A n ) = I, 
giving a contradiction. 



To show that <fi is onto, use Step 1 of the proof of Theorem 5.32 to show 
that on a sufficiently small neighborhood of zero in g, 

<p = log o0 o exp . 

Use this to show that the image of <fi contains a neighborhood of zero in g. 
Now use linearity to show that the image of 4> is all of g. 
19. Proof of Theorem 5.4\ - 



First suppose that kerLT Z> ker^. Then construct II as in the proof of 



Proposition 5.31 



Now suppose that there is a representation II of G for which the as- 
sociated Lie algebra representation is it. We want to show, then, that 
kerll D ker0. Well, define a new representation E of G by 

£ = no cj). 

Show that the associated Lie algebra homomorphism a is equal to 7r, so that, 



by Point (1) of Theorem 5.33 , n = S. What can you say about the kernel 
of S? 

20. Fix an integer n > 2. 

a) Show that every (finite-dimensional complex) representation of the 
Lie algebra sl(n;R) gives rise to a representation of the group SL(n;K), 
even though SL (n;R) is not simply connected. (You may use the fact that 
SL (n; C) is simply connected.) 
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b) Show that the universal cover of SL(n;R) is not isomorphic to any 
matrix Lie group. (You may use the fact that SL(n;R) is not simply con- 
nected.) 

21. Let G be a matrix Lie group with Lie algebra g, let f) be a subalgebra of g, 
and let H be the unique connected Lie subgroup of G with Lie algebra t). 
Suppose that there exists a compact simply connected matrix Lie group K 
such that the Lie algebra of K is isomorphic to h. Show that H is closed. 
Is H necessarily isomorphic to K1 



CHAPTER 6 



The Representations of SU(3), and Beyond 



1. Preliminaries 

There is a theory of the representations of semisimple groups/Lie algebras which 
includes as a special case the representation theory of SU(3). However, I feel that 
it is worthwhile to examine the case of SU(3) separately. I feel this way partly 
because SU(3) is an important group in physics, but chiefly because the general 
semisimple theory is difficult to digest. Considering a non-trivial example makes it 
much clearer what is going on. In fact, all of the elements of the general theory are 
present already in the case of SU(3), so we do not lose too much by considering at 
first just this case. 

The main result of this chapter is Theorem [j], which states that an irreducible 
finite-dimensional representation of SU(3) can be classified in terms of its "high- 
est weight." This is analogous to labeling the irreducible representations V m of 
SU(2)/sl(2; C) by the highest eigenvalue of ir m (H). (The highest eigenvalue of 
)r m (if) in V m is precisely m.) We will then discuss, without proofs, what the 
corresponding results are for general semisimple Lie algebras. 

The group SU(3) is connected and simply connected (Brocker and torn Dieck), 
so by Corollary |l| of Chapter 5, the finite-dimensional representations of SU(3) 
are in one-to-one correspondence with the finite-dimensional representations of the 
Lie algebra su(3). Meanwhile, the complex representations of su(3) are in one-to- 
one correspondence with the complex-linear representations of the complexified Lie 
algebra su(3)c- But su(3)c — si (3;C), as is easily verified. Moreover, since SU(3) 
is connected, it follows that a subspace W C V is invariant under the action of 
SU(3) if and only if it is invariant under the action of si (3;C). Thus we have the 
following: 

Proposition 6.1. There is a one-to-one correspondence between the finite- 
dimensional complex representations n o/SU(3) and the finite- dimensional complex- 
linear representations tt o/sl (3; C). This correspondence is determined by the prop- 
erty that 

U(e x ) =e"W 

for allX G su(3) C sl(3;C). 

The representation Yi is irreducible if and only the representation n is irre- 
ducible. Moreover, a subspace W C V is invariant for II if and only if it is invariant 
for it. 



Since SU(3) is compact, Proposition 5.17 of Chapter 5 tells us that all the 
finite-dimensional representations of SU(3) are completely reducible. The above 
proposition then implies that all the hnite-dimensional representations of si (3; C) 
are completely reducible. 
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Moreover, we can apply the same reasoning to the group SU(2), its Lie algebra 
su(2), and its complexified Lie algebra sl(2;C). Since SU(2) is simply connected, 
there is a one-to-one correspondence between the complex representations of SU(2) 
and the representations of the complexified Lie algebra si (2; C). Since SU(2) is com- 
pact, all of the representations of SU(2)-and therefore also of si (2; C)-are completely 
reducible. Thus we have established the following. 

Proposition 6.2. Every finite-dimensional (complex-linear) representation of 
sl(2;C) or si (3; C) is completely reducible. In particular, every finite-dimensional 
representation o/sl(2;C) or si (3; C) decomposes as a direct sum of irreducible in- 
variant subspaces. 

We will use the following basis for si (3; C): 



#i=0-10 Ho = 




X x = X 2 = 1 X. 





Fi = 1 Y 2 = Y 3 



Note that the span of {Hi, X\, Yi) is a subalgebra of si (3; C) which is isomor- 
phic to si (2; C), by ignoring the third row and the third column. Similarly, the span 
of {H 2 , X 2 , Y 2 } is a subalgebra isomorphic to sl(2; C), by ignoring the first row and 
first column. Thus we have the following commutation relations 



[Hi,X{\ = 2Xi [H 2 ,X 2 ] = 2X 2 
[Hi, Yi] = -2Yi [H 2 ,Y 2 ] = -2Y 2 
[Xi,Yi] = Hi [X 2 ,Y 2 ] = H 2 . 

We now list all of the commutation relations among the basis elements which 
involve at least one of Hi and H 2 . (This includes some repetitions of the commu- 
tation relations above.) 



(6.1) 



[Hi,H 2 ] = 









[Hi,Xi] = 
[H 2 ,Xi] = 


2Xi 
-Xi 


[Hi,Yi] = 
[H 2 ,Yi] = 


-2Yi 
Yi 


[Hi,X 2 ] = 
[H 2 ,X 2 \ = 


-x 2 

2X 2 


[Hi,Y 2 ] = 
[H 2 ,Y 2 ] = 


Y 2 
-2Y 2 


[Hi,X 3 ] = 
[H 2 ,X 3 ] = 


x 3 
x 3 


[Hi,Y 3 ] = 
[H 2 ,Y 3 ] = 


-Y 3 
-Y 3 



2. WEIGHTS AND ROOTS 
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We now list all of the remaining commutation relations. 



[Xi,Y 2 ] 
[X 3 ,Y 3 ] 


H 1 
Hi 

= m+H: 






[X U X 2 ] 
[Xi,Y 2 ] 


= x 3 




[YuY 2 ] = 
[X 2 ,Y X ] = 


-Y 3 



[Xi,X 3 ] 

[x 2 ,x 3 \ 






[Yi,Y 3 ] = 
[Y 2 ,Y 3 ] = 






[X 2 ,Y 3 ] 
[X!,Y 3 ] 


= Y 1 
= -Y 2 


[X 3 ,Y 2 ] = 
[Xa,Yi] = 


Xi 

-x 2 



Note that there is a kind of symmetry between the Xi's and the Y^s. If a 
relation in the first column involves an Xi and/or a Yj, the corresponding relation 
in the second column will involve a Yi and/or an Xj. (E.g., we have the relation 
[f/i,X2] = — X 2 in the hrst column, and the relation [H 2 ,Y 2 ] = Y 2 in the second 
column.) See Exercise 0. 

All of the analysis we will do for the representations of si (3; C) will be in terms 
of the above basis. From now on, all representations of si (3; C) will be assumed to 
be finite-dimensional and complex-linear. 

2. Weights and Roots 

Our basic strategy in classifying the representations of si (3; C) is to simultane- 
ously diagonalize tt(Hi) and it(H 2 ). Since H\ and H 2 commute, 7r(i?i) and n(H 2 ) 
will also commute, and so there is at least a chance that ir(Hi) and ir(H 2 ) can be 
simultaneously diagonalized. 

Definition 6.3. If(n,V) is a representation o/sl(3;C), then an ordered pair 
fi = (/ii, [i 2 ) G C 2 is called a weight for ir if there exists v ^ in V such that 

■n{Hi)v = fiiv 

(6.2) tt(H 2 )v = h 2 v. 

The vector v is called a weight vector corresponding to the weight /i. // /i = 



(/ii, fi 2 ) is a weight, then the space of all vectors v satisfying (6.i) is the weight 
space corresponding to the weight \x. 

Thus a weight is simply a pair of simultaneous eigenvalues for 7r(7Ji) and Tt(H 2 ). 

Proposition 6.4. Every representation o/sl(3;C) has at least one weight. 

Proof. Since we are working over the complex numbers, 7r(-Hi) has at least 
one eigenvalue Let W C V be the eigenspace for tt(Hi) with eigenvalue n\. I 
assert that W is invariant under ir(H 2 ). To see this consider w € W, and compute 

7r(.ffi) (tt(H 2 )w) = n(H 2 )w(Hi)w 

= tt(H 2 ) (fiiw) — fJ,in(H 2 )w. 

This shows that t:(H 2 )w is either zero or an eigenvector for tt(Hi) with eigenvalue 
fi\\ thus W is invariant. 
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Thus tt(H 2 ) can be viewed as an operator on W . Again, since we are over C, 
the restriction of tt(H 2 ) to W must have at least one eigenvector w with eigenvalue 
/i 2 . But then w is a simultaneous eigenvector for tt(H\) and ir(H%) with eigenvalues 
fii and fj,2. □ 

Now, every representation tt of si (3; C) can be viewed, by restriction, as a 
representation of the subalgebra {H X ,X X ,Y X } = sl(2;C). Note that, even if tt is 
irreducible as a representation of si (3;C), there is no reason to expect that it will 
still be irreducible as a representation of the subalgebra {H X ,X X ,Y{\. Neverthe- 
less, tt restricted to {H x , Xi,Y\} must be some finite-dimensional representation 
of sl(2;C). The same reasoning applies to the restriction of tt to the subalgebra 
{H 2 ,X 2 , Y 2 }, which is also isomorphic to si (2; C). 

Proposition 6.5. Let {tt, V) be any finite- dimensional complex-linear repre- 
sentation of sl(2;C) = {H,X, y}. Then all the eigenvalues of n(H) are integers. 



Proof. By Proposition |6.2| , V decomposes as a direct sum of irreducible in- 
variant subspaces Vi. Each V, must be one of the irreducible representations of 
sl(2;C), which we have classified. In particular, in each Vi, tt{H) can be diagonal- 
ized, and the eigenvalues of tt(H) are integers. Thus tt(H) can be diagonalized on 
the whole space V, and all of the eigenvalues are integers. □ 

COROLLARY 6.6. If tt is a representation of si (3;C), then all of the weights of 
tt are of the form 

fi = (mi, 7712) 

with mi and "i2 integers. 

Proof. Apply Proposition |6.5| to the restriction of tt to {Hi,Xi, Yi}, and to 
the restriction of tt to {H 2 , X 2 , Y 2 }. □ 

Our strategy now is to begin with one simultaneous eigenvector for tt(Hi) and 
tt(H 2 ), and then to apply 7r(Xj) or w(Yi), and see what the effect is. The following 



definition is relevant in this context. (See Lemma 6.8 below.) 

Definition 6.7. An ordered pair a = (a x , a 2 ) e C 2 is called a root if 

1. ct\ and a 2 are not both zero, and 

2. there exists Z £ si (3;C) such that 

[H 1 ,Z] = a x Z 
[H 2 , Z] = a 2 Z. 

The element Z is called a root vector corresponding to the root a. 

That is, a root is a non-zero weight for the adjoint representation. The com- 



mutation relations (|6.1| ) tell us what the roots for si (3; C) are. There are six roots. 

a Z 

(2,-1) Xi 

(-1,2) X 2 

(6.3) (1,1) X 3 

(-2,1) Y X 

(1,-2) Y 2 

(-1,-1) Y 3 



3. HIGHEST WEIGHTS AND THE CLASSIFICATION THEOREM 
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It is convenient to single out the two roots corresponding to X\ and X 2 and 
give them special names: 

a« = (2,-l) 

(6.4) a (2) = (-l,2). 

The roots a^ 1 ) and a 1 - 2 ) are called the simple roots. They have the property that 
all of the roots can be expressed as linear combinations of o^ 1 ) anc [ a ( 2 ) w itfi integer 
coefficients, and these coefficients are either all greater than or equal to zero or all 
less than or equal to zero. This is verified by direct computation: 

(2,-1) = a« 

(-1,2) = a< 2 > 

(1,1) = a^+a^ 

(-2,1) = -a« 

(1,-2) = - Q P) 

(-1,-1) = -a«-a( 2 ). 

The significance of the roots for the representation theory of si (3;C) is con- 
tained in the following Lemma. Although its proof is very easy, this Lemma plays 
a crucial role in the classification o f the representations of sl(3;C). Note that 
this Lemma is the analog of Lemma 5.10| of Chapter 5, which was the key to the 



classification of the representations of si (2; C). 

Lemma 6.8. Let a = (ai, a 2 ) be a root, and Z a 7^ a corresponding root vector 
in si (3; C). Let n be a representation of si (3; C), [i = (mi, 7712) a weight for tt, and 
v 7^ a corresponding weight vector. Then 

n(Hi)ir(Z a )v = (mi + ai)ir(Z a )v 

ir(H 2 )Tr(Z a )v = (m 2 + a 2 )Tr(Z a )v. 

Thus either Tt(Z a )v — or else Tr(Z a )v is a new weight vector with weight 

A* + a = (mi + ai , m 2 + a 2 ) . 

Proof. The definition of a root tells us that we have the commutation relation 
[HiZ a ] = u\Z a . Thus 

7r(Hi)Tr(Z a )v = (Tr(Z a )Tr(Hi) + ain(Z a )) v 
= n(Z a )(m 1 v) + a 1 n(Z a )v 
— (mi + cti)-K{Z a )v. 

A similar argument allows us to compute Tt(H2)ir(Z a )v. □ 

3. Highest Weights and the Classification Theorem 

We see then that if we have a representation with a weight /i = (mi, m 2 ), then 
by applying the root vectors X\, X 2 , X 3 , Yi, Y 2 , 13 we can get some new weights of 
the form jj, + a, where a is the root. Of course, some of the weight vectors may 
simply give zero. In fact, since our representation is finite-dimensional, there can 
be only finitely many weights, so we must get zero quite often. By analogy to 
the classification of the representations of si (2; C), we would like to single out in 
each representation a "highest" weight, and then work from there. The following 
definition gives the "right" notion of highest. 
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Definition 6.9. Let = (2,-1) and = (—1,2) be the roots introduced 



in (6. J))- Let n\ and [Xi be two weights. Then fii is higher than fj,i (or equivalently, 
/i2 is lower than if [i\ — \i 2 can be written in the form 

Mi — M2 = aa' 1 ' + ba^ 

with a > and b > 0. This relationship is written as fi% >z \i% or ^ A*l- 

If it is a representation o/sl (3; C), then a weight fio for it is said to be a highest 
weight if for all weights h of it, \x -< \xq. 

Note that the relation of "higher" is only a partial ordering. That is, one can 
easily have /ii and /i 2 such that \i\ is neither higher nor lower than /i 2 . For example, 

— is neither higher nor lower than 0. This in particular means that a finite 
set of weights need not have a highest element. (E.g., the set {0, — a^ 2 ^} has 
no highest element.) 

We are now ready to state the main theorem regarding the irreducible repre- 
sentations of si (3; C). 

Theorem 6.10. 1. Every irreducible representation 7r o/sl (3;C) is the di- 
rect sum of its weight spaces. That is, tt(Hi) and n(Hi) are simultaneously 
diagonalizable. 

2. Every irreducible representation o/sl(3;C) has a unique highest weight fio, 
and two equivalent irreducible representations have the same highest weight. 

3. Two irreducible representations o/sl (3; C) with the same highest weight are 
equivalent. 

4. If tt is an irreducible representation o/sl(3;C), then the highest weight fio 
of it is of the form 

Ho = (mi,m 2 ) 

with mi and mi non-negative integers. 

5. Conversely, if mi and mi are non-negative integers, then there exists a 
unique irreducible representation tt of si (3; C) with highest weight (J,q = 
(mi,m 2 ). 

Note the parallels between this result and the classification of the irreducible 
representations of sl(2;C): In each irreducible representation of sl(2;C), n(H) is 
diagonalizable, and there is a largest eigenvalue of ir(H). Two irreducible repre- 
sentations of sl(2;C) with the same largest eigenvalue are equivalent. The highest 
eigenvalue is always a non-negative integer, and conversely, for every non-negative 
integer m, there is an irreducible representation with highest eigenvalue m. 

However, note that in the classification of the representations of si (3; C) the 
notion of "highest" does not mean what we might have thought it should mean. 
For example, the weight (1, 1) is higher than the weights (—1,2) and (2,-1). (In 
fact, (1, 1) is the highest weight for the adjoint representation, which is irreducible.) 

It is possible to obtain much more information about the irreducible represen- 
tations besides the highest weight. For example, we have the following formula for 
the dimension of the representation with highest weight (mi, mi). 

Theorem 6.11. The dimension of the irreducible representation with highest 
weight (mi, mi) is 

~(mi + l)(m 2 + l)(mi + m 2 + 2). 
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We will not prove this formula. It is a consequence of the "Weyl character 
formula." See Humphreys, Section 24.3. Humphreys refers to si (3;C) as A 2 . 

4. Proof of the Classification Theorem 

It will take us some time to prove Theorem The proof will consist of a series 
of Propositions. 

Proposition 6.12. In every irreducible representation (tt, V) o/sl (3; C) , 7r(ifi) 
and tt(H2) can be simultaneously diagonalized. That is, V is the direct sum of its 
weight spaces. 

Proof. Let W be the direct sum of the weight spaces in V. Equivalently, W is 
the space of all vectors w € V such that w can be written as a linear combination of 



simultaneous eigenvectors for n(Hi) and tt(H2). Since (Proposition 6.4) 7r always 
has at least one weight, W ^ {0}. 

On the other hand, Lemma |6.8| tells us that if Z a is a root vector corresponding 
to the root a, then ir(Z a ) maps the weight space corresponding to [i into the weight 
space corresponding to n + a. Thus W is invariant under the action of all of the root 
vectors, namely, under the action Xi, X 2 , X 3 ,Yi,Y 2 , and Y3. Since W is certainly 
invariant under the action of Hi and H 2 , W is invariant. Thus by irreducibility, 
W = V. □ 

Definition 6.13. A representation (tv,V) o/sl(3;C) is said to be a highest 
weight cyclic representation with weight fio = (mi,mj) if there exists b ^ 
in V such that 

1. v is a weight vector with weight [j,q. 

2. tt(Xi> = ir(X 2 )v = 0. 

3. The smallest invariant subspace of V containing v is all of V . 
The vector v is called a cyclic vector for n. 

Proposition 6.14. Let (tt, V) be a highest weight cyclic representation o/sl (3; C) 
with weight fiQ. Then 

1. 7r has highest weight 

2. The weight space corresponding to the highest weight /j,q is one-dimensional. 
4.0.1. Proof. 

Proof. Let v be as in the definition. Consider the subspace W of V spanned 
by elements of the form 

(6.5) w=ir(Y ix )Tr(Y i2 )---7r(Y in )v 



with each i; = 1, 2, and n > 0. (If n = 0, it is understood that w in ( |6.5| ) is equal to 
v.) I assert that W is invariant. To see this, it suffices to check that W is invariant 
under each of the basis elements. 

By definition, W is invariant under 7r(Yi) and ^(l^)- It is thus also invariant 
under 7^3) = - kfti), tt(Y 2 )]. 

Now, Lemma |6~§| tells us that applying a root vector Z a S si (3; C) to a weight 
vector v with weight fi gives either zero, or else a new weight vector with weight fj,+ 
a. Now, by assumption, v is a weight vector with weight /io- Furthermore, Y\ and 
Y 2 are root vectors with roots — — (—2,1) and — — (1,-2), respectively. 
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(See Equation (6.3).) Thus each application of 7r(Yi) or ^(Yi) subtracts or 
q,( 2 ) f rom the weight. In particular, each non-zero element of the form (6.5) is a 
simultaneous eigenvector for tt{H{) and tt(H2). Thus W is invariant under w(Hi) 
and 7r(iJ 2 ). 

To show that W is invariant under ir(Xi) and 7r(X 2 ), we argue by induction on 
n. For n = 0, we have n(Xi)v = n(X 2 )v — € W. Now consider applying 7r(Xi) 
or 7r(X 2 ) to a vector of the form ( |6.5| ). Recall the commutation relations involving 
an ^! or X 2 and a Yi or Y 2 : 

[x x ,yi] = ffi [Xi,y 2 ] = o 
[x 2 ,Yi] = o [x 2 ,y 2 ] = ff. 



2 • 



Thus (for z and j equal to 1 or 2) w(Xi)n(Yj) = w(Yj)ir(Xi) +ir(Hij), where Hij is 
either Hi or i? 2 or zero. Hence (for i equal to 1 or 2) 

n(X i )ir{Y il )Tr{Y i2 )---Tr{Y in )v 
= u(Y n ) 7 r(X i )7r(y 2 ) ■ ■ • vr(y„ )« + 7r(i/ y >(y 2 ) • • ■ tt(Y„ )«. 

But 7r(Xi)7r(y 2 ) ■ • ■ir(Yi n )v is in W by induction, and 7r(-ffy)7r(y 2 ) • • •7t(Y ?i )d is 
in W since IV is invariant under Tr(-ffi) and 7r(iJ 2 ). 

Finally, Ty is invariant under 77(^3) since 71(^3) = [7r(Xi), 7r(X 2 )]. Thus W is 
invariant. Since by definition W contains v, we must have W — V. 

Since Y\ is a r oot vector with root — a^ 1 -* and Y 2 is a ro ot vector with root 
— oS- 2 \ Lemma |6.8| tells us that each element of the form (S.5) is either zero or a 
weight vector with weight (1q — a^ 11 ' — ■ ■ ■ — a^ 1 "'. Thus V = W is spanned by v 
together with weight vectors with weights lower than fj,Q- Thus /iq is the highest 
weight for V. 

Furthermore, every element of W can be written as a multiple of v plus a linear 
combination of weight vectors with weights lower than /iq. Thus the weight space 
corresponding to ^0 is spanned by v; that is, the weight space corresponding to [Xq 
is one-dimensional. □ 



Proposition 6.15. Every irreducible representation o/sl(3;C) is a highest 
weight cyclic representation, with a unique highest weight /io- 

Proof. Uniqueness is immediate, since by the previous Proposition, /j, is the 
highest weight, and two distinct weights cannot both be highest. 

We have already shown that every irreducible representation is the direct sum 
of its weight spaces. Since the representation is finite-dimensional, there can be 
only finitely many weights. It follows that there must exist a weight /io such that 
there is no weight /x 7^ /io with (i y /io. This says that there is no weight higher 
than /io (which is not the same as saying the /io is highest). But if there is no 
weight higher than /io, then for any non-zero weight vector v with weight //q, we 
must have 

n{Xi)v = tt(X 2 )v = 0. 

(For otherwise, say, n(Xi)v will be a weight vector with weight fio + o^ 1 ^ >~ Mo-) 

Since 7r is assumed irreducible, the smallest invariant subspace containing v 
must be the whole space; therefore the representation is highest weight cyclic. 
□ □ 
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Proposition 6.16. Every highest weight cyclic representation o/sl (3;C) is ir- 
reducible. 

Proof. Let (tt, V) be a highest weight cyclic representation with highest weight 
Hq and cyclic vector v. By complete reducibility (Proposition 6.2), V decomposes 
as a direct sum of irreducible representations 



(6.6) 



By Proposition 6.12 , each of the V^'s is the direct sum of its weight spaces. 
Thus since the weight (mq occurs in V, it must occur in some Vi. On the other hand, 
Proposition 3.14 says that the weight space corresponding to fi is one-dimensional, 
that is, v is (up to a constant) the only vector in V with weight /xo- Thus V, must 
contain v. But then that Vi is an invariant subspace containing v, so Vi = V. Thus 
there is only one term in the sum (3.6), and V is irreducible. □ 



Proposition 6.17. Two irreducible representations o/sl(3;C) with the same 
highest weight are equivalent. 

PROOF. We now know that a representation is irreducible if and only if it is 
highest weight cyclic. Suppose that (n, V) and (a, W) are two such representations 
with the same highest weight /io- Let v and w be the cyclic vectors for V and 
W , respectively. Now consider the representation V © W, and let U be smallest 
invariant subspace of V © W which contains the vector (v, w). 

By definition, U is a highest weight cyclic representation, therefore irreducible 



by Proposition. 6.16. Consider the two "projection" maps Pi : V © W — > V, 
Pi(v, w) = v and Pi : V © W — > W, Pi(v, w) = w. It is easy to check that Pi and 
Pi are morphisms of representations. Therefore the restrictions of Pi and Pi to 
U C V © W will also be morphisms. 

Clearly neither P\\ v nor Pi\jj is the zero map (since both are non-zero on 
(v,w)). Moreover, U, V, and W are all irreducible. Therefore, by Schur's Lemma, 
Pi\u is an isomorphism of U with V, and Pi\u is an isomorphism of U with W. 
ThusF = U ^W. □ 



Proposition 6.18. If n is an irreducible representation o/sl(3;C), then the 
highest weight of n is of the form 

/! = (mi,m 2 ) 
with mi and mi non-negative integers. 

Proof. We already know that all of the weights of ir are of the form (mi , mi), 
with mi and mi integers. We must show that if fio — (mi , mi) is the highest weight, 
then mi and mi are both non-negative. For this, we again use what we know about 
the representations of si (2; C). The following result can be obtained from the proof 
of the classification of the irreducible representations of si (2; C). 

Let (it, V) be any finite-dimensional representation of sl(2;C). Let v be an 
eigenvector for w(H) with eigenvalue A. If ir(X)v = 0, then A is a non-negative 
integer. 

Now, if 7r is an irreducible representation of si (3; C) with highest weight /i = 
(mi, mi), and if v ^ is a weight vector with weight then we must have 
ir(Xi)v = ir(X 2 )v — 0. (Otherwise, no wouldn't be highest.) Thus applying the 
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above result to the restrictions of tt to {Hi,X%, Y±} and to {H2, X2, Y2} shows that 
mi and 7712 must be non-negative. □ 

Proposition 6.19. If mi and 7712 are non-negative integers, then there exists 
an irreducible representation of si (3;C) with highest weight \i — (7711,7712). 

Proof. Note that the trivial representation is an irreducible representation 
with highest weight (0, 0). So we need only construct representations with at least 
one of 777 1 and 777 2 positive. 

First, we construct two irreducible representations with highest weights (1,0) 
and (0, 1). (These are the so-called fundamental representations.) The stan- 
dard representation of si (3; C) is an irreducible representation with highest weight 
(1,0), as is easily checked. To construct an irreducible representation with weight 
(0, 1) we modify the standard representation. Specifically, we define 

(6.7) n(Z) = -Z tr 

for all Z e si (3; C). Using the fact that (AB) tr = B tr A tr , it is easy to check that 

-\z x ,z % t = {-z%-z%\ 

so that 7r is really a representation. (This is isomorphic to the dual of the standard 
representation, as defined in Exercise [14] of Chapter 5.) It is easy to see that 7r is 
an irreducible representation with highest weight (0, 1). 

Let (-7Ti, Vi) denote C 3 acted on by the standard representation, and let vi de- 
note a weight vector corresponding to the highest weight (1, 0). (So, v x — (1,0, 0).) 
Let (^2,14) denote C 3 acted on by the representation (|6.7[ ), and let v 2 denote a 
weight vector for the highest weight (0, 1). (So, v% = (0,0, 1).) Now consider the 
representation 

Vi ® Vi ■ ■ ■ <g> Vi <g> V 2 ® V2 • • • V% 

where V\ occurs mi times, and V 2 occurs 7712 times. Note that the action of si (3; C) 
on this space is 

Z -> {-K\ (Z) I ■ • • ® J) 

(6.8) + (I <8> 7Ti (Z) ® / • • • ® 7) + • • • + (I ® • • • I <g> tt 2 (Z)) . 

Let 7r mijm2 denote this representation. 
Consider the vector 

V mi ,m 2 = Vl ® Vi ■ ■ ■ ® Vl ® V2 ® V2 ■ ■ ■ <g> V2- 



Then applying (6.8) shows that 

^mi .ma 0^1 )^mi .m2 — ^l^mi,m2 
^mi ,m2 (-^2 J^mi ,m2 — ^2^mi,m2 
7Tmi,m2 (-^"1)^7711 ,m2 ^ 
(6.9) TTmi ,7712(^2)^7711,70.2 = 0. 

Now, the representation 7r mijm2 is not irreducible (unless (7771,7772) = (1,0) or 
(0, 1)). However, if we let W denote the smallest invariant subspace containing the 
vector v mi , m2 , then in light of (|6.9]), W will be highest weight cyclic with highest 
weight (7771,7772)- Therefore by Proposition |6.16 , W is irreducible with highest 
weight (mi, 777,2). 

Thus W is the representation we want. □ 
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We have now completed the proof of Theorem [j]. 

5. An Example: Highest Weight (1,1) 

To obtain the irreducible representation with highest weight (1,1) we are sup- 
posed to take the tensor product of the irreducible representations with highest 
weights (1,0) and (0,1), and then extract a certain invariant subspace. Let us 
establish some notation for the representations (1,0) and (0,1). In the standard 
representation, the weight vectors for 



H = -1 ; H 2 = 





are the standard basis elements for C 3 , namely, ei, e 2 , and e 3 . The corresponding 
weights are (1,0), (—1, 1), and (0, —1). The highest weight is (1,0). 
Recall that 

Yx 

Thus 
























i 












: 









V o 


1 







Yx(ex) = 


e 2 


Y 2 (ex) = 





(6.10) 


Yx{e 2 ) = 





Y 2 (e 2 ) = 






Yx(e 3 ) = 





Y 2 (e 3 ) = 


0. 



Now, the representation with highest weight (0, 1) is the representation ir(Z) 
-Z tr , for Z e si (3; C). Let us define 

Z = -Z tr 

for all Z € si (3; C). Thus tt(Z) = ~Z. Note that 

-10 





Hi I • ) // 

The weight vectors are again ex, e 2 , and e%, with weights (—1, 0), (1, —1), and (0, 1). 
The highest weight is (0, 1). 



Define new basis elements 



Then since 



we have 



fx = e 3 
h = ~ e 2 
fz = ex- 



Yx = ; Y 2 = 





Mfx) = Y^fx) = h 

(6.H) Yx(h) = h Mfc) = 

Yx(f 3 ) = Y 2 (f 3 ) = 0. 

Note that the highest weight vector is fx = e 3 . 
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So, to obtain an irreducible representation with highest weight (1, 1) we are 
supposed to take the tensor product of the representations with highest weights 
(1, 0) and (0, 1), and then take the smallest invariant subspace containing the vector 



e\ ® f\. In light of the proof of Proposition 3.14, this smallest invariant subspace 
is obtained by starting with e\ ® fx and applying all possible combinations of Yl 
and Y 2 . 

Recall that if ttx and 7r 2 are two representations of the Lie algebra si (3; C), then 

(7T1 <g> 7T 2 ) (Yi) =7Tl(Yi) ®/ + /®7T2(Yl) 

(tti ® tt 2 ) (Y 2 ) =7ri(Y 2 )<8>/ + I®n 2 {Y 2 ). 
In our case we want 7Ti(Y) = Y and 7r 2 (Y) = Y. Thus 

(tti <g> 7r 2 ) (Yi) = Yi ® I + 1 <g> FX 

(7Tl <g> 7T 2 ) (Y 2 ) = Y 2 ® I + I ® %. 



The actions of Y and Y are described in ( 3.10 ) and ( 6.11 ). 

Note that 7Ti <g) 7r 2 is nof an irreducible representation. The representation 
tti <E)7r 2 has dimension 9, whereas the smallest invariant subspace containing e\ ® fx 
has, as it turns out, dimension 8. 

So, it remains only to begin with ex (8> /i, apply Yi and Y 2 repeatedly until we 
get zero, and then figure out what dependence relations exist among the vectors we 
get. These computations are done on a supplementary page. Note that the weight 
(0,0) has multiplicity two. This is because, starting with ex ® /i, applying Y\ and 
then Y 2 gives something different than applying Y 2 and then Yl. 

6. The Weyl Group 

The set of weights of an arbitrary irreducible representation of si (3; C) has a 
certain symmetry associated to it. This symmetry is in terms of something called 
the "Weyl group." (My treatment of the Weyl group follows Brocker and torn 
Dieck, Chap. IV, 1.3.) We consider the following subgroup of SU(3): 



wo = U 1 U ; wx = 1 U U ; w 2 



W = < 




W3 = — i u u ; u> 4 = — u i u ; w 5 





These are simply the matrices which permute the standard basis elements of C 3 , 
with an adjustment of overall sign when necessary to make the determinant equal 
one. 

Now, for any A 6 SU(3), we have the associated map AdA : su(3) — > su(3), 
where 

AdA(X) = AX A" 1 . 

Now, since each element of si (3; C) is of the form Z = X + iY with X, Y £ su(3), 
it follows that si (3;C) is invariant under the map Z — > AZA^ 1 . That is, we can 
think of Ad^4 as a map of si (3; C) to itself. 

The reason for selecting the above group is the following: If w £ W, then 
Adw(Hx) and Adw(H 2 ) are linear combinations of Hx and H 2 . That is, each 
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Adw preserves the space spanned by H± and Hi- (There are other elements of 
SU(3) with this property, notably, the diagonal elements. However, these actually 
commute with Hi and Hi- Thus the adjoint action of these elements on the span 
of Hi and Hi is trivial and therefore uninteresting. See Exercise ||.) 

Now, for each w G W and each irreducible representation tt of si (3;C), let's 
define a new representation tt w by the formula 

tt w (X) = tt (Adw- l (X)) = niw^Xw). 

Since Adw -1 is a Lie algebra automorphism, tt w will in fact be a representation of 
sl(3;C). 

Recall that since SU(3) is simply connected, then for each representation tt of 
si (3; C) there is an associated representation II of SU(3) (acting on the same space) 
such that 

Il(e x ) =e*W 

for all X G su(3) C si (3; C). The representation II has the property that 

(6.12) tt(AXA- 1 ) = n(^) 7 r(x)n(A)- 1 

for all X G su(3). Again since ever y element of si (3; C) is of the form X + iY with 
X, Y G su(3), it follows that ( jsl3 ) holds also for X G si (3; C). 
In particular, taking A — w^ 1 G W we have 

(6.13) n w (X) = niw^Xw) = IL(w)- 1 tt(X)IL(w) 
for aRXe sl(3;C). 

Proposition 6.20. For each representation tt o/sl (3;C) and for each w G W, 
the representation tt w is equivalent to the representation tt. 

Proof. We need a map <f> :V —>V with the property that 

4>{TT W {X)V) =TT{X)cp{v) 

for all v G V. This is the same as saying that cfm w (X) — tt{X)4>, or equivalently 
that tt w (X) — <t>^ 1 TT(X)<j). But in light of (3.12), we can take <f> = H(w). □ 



Although tt and tt w are equivalent, they are not equal. That is, in general 
tt(X) ^ tt w (X). You should think of tt and tt w as differing by a change of basis on 
V, where the change-of-basis matrix is II(w). Two representations that differ just 
by a change of basis are automatically equivalent. 

Corollary 6.21. Let tt be a representation of si (3; C) and w G W. Then 
a pair fi — (mi,TO2) is a weight for tt if and only if it is a weight for tt w . The 
multiplicity of /x as a weight of tt is the same as the multiplicity of fi as a weight 
for tt w . 



Proof. Equivalent representations must have the same weights and the same 
multiplicities. □ 
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Let us now compute explicitly the action of Adw 1 on the span of Hi and H 2 , 
for each w € W. This is a straightforward computation. 



(6.14) 



We can now see the significance of the Weyl group. Let ir be a representation 
of si (3; C), /1 = (mi, m 2 ) a weight, and u^Oa weight vector with weight /i. Then, 
for example, 

7r Wl (Hi)v = tt(wi HiW\)v = n(—Hi - H 2 )v = (—mi - m 2 )v 
k Wi {H 2 )v = ■k(wi 1 H 2 wi)v = n(Hi)v = miv. 
Thus v is a weight vector for ir w with weight (—mi — m 2 ,mi). But by Corollary 





^iWo = 


Hi 




« ! 3 






Hi 




1 H 2 w = 


Hi 




W3 


1 H 2 w 3 


= Hi 


+ H 2 


w l 


l HiWi = 


-Hi - 


H 2 


W~ 


1 H 1 w 4 , 




Hi 


w l 


l H 2 wi = 


Hi 






1 H 2 wa 




Hi 


•U>2 


l H x w 2 = 


H 2 






l H lW5 


= Hi 


+ H 2 


»<2 


1 H 2 w 2 = 


-Hi - 


Hi 


"'5 






H 2 . 



5.21, the weights of n and of tt w are the same! 

Conclusion: If fi — (mi, m 2 ) is a weight for n, so is (—mi —m 2 , mi). 

The multiplicities of (mi,m 2 ) and (—mi — m 2l mi) are the same. 
Of course, a similar argument applies to each of the other elements of the 
Weyl group. Specifically, if /i is a weight for some representation w, and w is an 
element of W, then there will be some new weight which must also be a weight 
of 7r. We will denote this new weight w ■ /i. For example, if /1 = (mi,m 2 ), then 
W\- ji = (—mi — m 2 , mi). (We define w ■ [i so that if v is a weight vector for ir with 



weight /i, then v will be a weight for n w with weight w ■ fi.) From (3.14) we can 
read off what w ■ [i is for each w. 

(6.15) 

w -(mi,m 2 ) = (mi,m 2 ) w 3 -(mi,m 2 ) = (-mi,mi+m 2 ) 

wi-(mi,m 2 ) = (-mi-m 2 ,mi) wi-(mi,m 2 ) = (-m 2 ,-mi) 

w 2 -(mi,m 2 ) = (m 2 ,-mi-m 2 ) w 5 -(mi,m 2 ) = (mi+m 2 ,-m 2 ) 

It is straightforward to check that 

(6.16) Wi ■ (wj ■ fi) = (wiWj) ■ fi. 

We have now proved the following. 

Theorem 6.22. If fi = (mi,m 2 ) is a weight and w is an element of the Weyl 
group, let w ■ fi be defined by ( 6.1b\j . If ir is a finite- dimensional representation 



of si (3; C), then /1 is a weight for ir if and only if w ■ fi is a weight for ir. The 
multiplicity of fi is the same as the multiplicity of w ■ /1. 

If we think of the weights fi = (mi,m 2 ) as sitting inside R 2 , then we can 
think of (6.15) as a finite group of linear transformations of M. 2 . (The fact that 



this is a group of transformations follows form ( |6.16j ).) Since this is a finite group 
of transformations, it is possible to choose an inner product on R 2 such that the 
action of W is orthogonal. (As in the proof of Proposition |5.16 in Chapter 5.) 



In fact, there is (up to a constant) exactly one such inner product. In this inner 



product, the action (6.15) of the Weyl group is generated by a 120° rotation and a 



7. COMPLEX SEMISIMPLE LIE ALGEBRAS 



115 



reflection about the y-axis. Equivalently, the Weyl group is the symmetry group of 
an equilateral triangle centered at the origin with one vertex on the y-axis. 

7. Complex Semisimple Lie Algebras 

This section gives a brief synopsis of the structure theory and representation 
theory of complex semisimple Lie algebras. The moral of the story is that all such 
Lie algebras look and feel a lot like sl(3;C). This section will not contain any 
(non-trivial) proofs. 

If g is a Lie algebra, a subspace I C g is said to be an ideal if [X, Y] G I for 
all X G g and all Y G I. A Lie algebra g is a said to be simple if dimg > 2 and g 
has no ideals other than {0} and g. A Lie algebra g is said to be semisimple if g 
can be written as the direct sum of simple Lie algebras. 

In this section we consider semisimple Lie algebras over the complex numbers. 
Examples of complex semisimple Lie algebras include si (n; C), so(n; C) (n > 3), and 
sp(n;C). All of these are actually simple, except for so(4;C) which is isomorphic 
to sl(2;C)0sl(2;C). 

Definition 6.23. Let g be a complex semisimple Lie algebra. A subspace f) of 
g is said to be a Cartan subalgebra if 

1. f) is abelian. That is, [Hi, H%[ = for all Hi, Hi G f). 

2. f) is maximal abelian. That is, if X G g satisfies [H,X] — for all H G t), 
then lei). 

3. For all H G I), &dH : g — > g is diagonalizable. 

Since all the H's commute, so do the adH's. (I.e., [adffi, ad^] = ad [Hi, H?\ = 
0.) By assumption, each ad-ff is diagonalizable, and they commute, therefore the 
adiJ's are simultaneously diagonalizable. (Using a standard linear algebra fact.) 
Let f)* denote the dual of (), namely, the space of linear functionals on t). 

Definition 6.24. If g is a complex semisimple Lie algebra and t) a Cartan 
subalgebra, then an element a off)* is said to be a root (for g with respect to t)) if 
a is non-zero and there exists Z ^ in g such that 

(6.17) [H,Z]=a(H)Z 

for all H G f). (Thus a root is a non-zero set of simultaneous eigenvalues for the 
adH 's.) 

The vector Z is called a root vector corresponding to the root a, and the space 



of all Z G g satisfying (6.17) is the root space corresponding to a. This space is 
denoted g Q . 

The set of all roots will be denoted A. 

Note that if g = sl(3;C), then one Cartan subalgebra is the space spanned 
by Hi and H^ . The roots (with respect to this Cartan subalgebra) have been 
calculated in 



Theorem 6.25. If g is a complex semisimple Lie algebra, then a Cartan subal- 
gebra f) exists. Ifi)i and t)2 are two Cartan subalgebras, then there is an automor- 
phism of g which takes f)i to f)2- In particular, any two Cartan subalgebras have 
the same dimension. 



From now on, g will denote a complex semisimple Lie algebra, and f) a fixed 
Cartan subalgebra in g. 
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Definition 6.26. The rank of a complex semisimple Lie algebra is the dimen- 
sion of a Cartan subalgebra. 

For example, the rank of si (n; C) is n — 1. One Cartan subalgebra in si (n; C) 
is the space of diagonal matrices with trace zero. (Note that in the case n = 3 
the space of diagonal matrices with trace zero is precisely the span of Hi and H 2 .) 
Both so(2n; C) and so(2n + 1; C) have rank n. 

Definition 6.27. Let (tt, V) be a finite- dimensional, complex-linear represen- 
tation of g. Then n S f)* is called a weight for ir if there exists u / » F such 
that 

tt(H)v = fi{H)v 

for all H € t). The vector v is called a weight vector for the weight fi. 

Note that the roots are precisely the non-zero weights for the adjoint represen- 
tation. 

LEMMA 6.28. Let a be a root and Z a corresponding root vector. Let fi be a 
weight for a representation it and v a corresponding weight vector. Then either 
tt(Z)v = or else n(Z)v is a weight vector with weight fj, + a. 

Proof. Same as for si (3; C). □ 

Definition 6.29. A set of roots {ai, •••«;} is called a simple system (or 
basis) if 

1. {cui, • • • a{\ is a vector space basis for f)*. 

2. Every root a e A can be written in the form 

a = n x a x + n 2 a 2 H h n;a; 

with each rii an integer, and such that the nt 's are either all non-negative or 
all non-positive. 

A root a is said to be positive (with respect to the given simple system) if the 
rii 's are non-negative; otherwise a is negative. 

If q — si (3; C) and () = {Hi,H 2 }, then one simple system of roots is {a^\a^ } = 
{(2, — 1) , (— 1, 2)} (with the corresponding root vectors being X\ and^). The posi- 
tive roots are {(2, -1) , (-1,2) , (1, 1)}. The negative roots are {(-2, 1) , (1,-2) , (-1, 

Definition 6.30. Let {ai, ■ ■ ■ a{\ be a simple system of roots and let \i\ and 
[i 2 be two weights. Then \x\ is higher than [i 2 (or [i 2 is lower than \i\) if \i\ — \i 2 
can be written as 

Hi — H2 = aiai + a 2 a 2 H h aim 

with ai > 0. This relation is denoted /ii >z ^ 2 or /i 2 -< fii. 

A weight /i for a representation ir is highest if all the weights \x of ir satisfy 

The following deep theorem captures much of the structure theory of semisimple 
Lie algebras. 

Theorem 6.31. Let g be a complex semisimple Lie algebra, f) a Cartan subal- 
gebra, and A the set of roots. Then 

1. For each root a e A, the corresponding root space g a is one-dimensional. 
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2. If a is a root, then so is —a. 

3. A simple system of roots {a\, • • • a{\ exists. 

We now need to identify the correct set of weights to be highest weights of 
irreducible representations. 

Theorem 6.32. Let {ct\, • • • a{\ denote a simple system of roots, Xi an element 
of the root space Q ai and Yi an element of the root space Q~ ai . Define 

Hi = [X^Yi]. 

Then it is possible to choose Xi and Yi such that 

1. Each Hi is non-zero and contained in f). 

2. The span of {Hi, Xi,Yi} is a subalgebra of q isomorphic (in the obvious way) 
tosl(2;C). 

3. The set {Hi, ■ ■ ■ Hi} is a basis for t). 

Note that (in most cases) the set of all H^s, X^s, and Y's (i = 1,2, • • • I) do 
not span g. In the case g = si (3; C), I — 2, and the span of H\,X\, Y\, Hi,Xi, Yi 
represents only six of the eight dimensions of si (3; C). Nevertheless the subalgebras 
{Hi, Xi,Yi} play an important role. 

We are now ready to state the main theorem. 

Theorem 6.33. Let g be a complex semisimple Lie algebra, fj a Cartan subal- 
gebra, and {ct\, ■ ■ ■ a{\ a simple system of roots. Let {Hi, ■ ■ ■ H{\ be as in Theorem 



6. 3i . Then 

1. In each irreducible representation tt of q, the ir(H)'s are simultaneously di- 
agonalizable. 

2. Each irreducible representation of q has a unique highest weight. 

3. Two irreducible representations of q with the same highest weight are equiv- 
alent. 

4. // /io is the highest weight of an irreducible representation of Q, then for 
i = 1,2, ■ ■ ■ I, fio(Hi) is a non-negative integer. 

5. Conversely, if /io £ I)* is such that fj,o(Hi) is a non-negative integer for all 
i = 1,2,- ■■ I, then there is an irreducible representation of Q with highest 
weight /Iq. 

The weights fiQ as in 4) and 5) are called dominant integral weights. 



8. Exercises 

1. Show that for any pair ofnxn matrices X and Y, 

[X tr ,Y tr ] =-[X,Yf. 

Using this fact and the fact that X\ r = Yi for i = 1, 2, 3, explain the symme- 
try between X's and F's in the commutation relations for si (3; C). For exam- 
ple, show that the relation [Yi, Y2] = — Y3 can be obtained from the relation 
[Xl,X2] = ^3 by taking transposes. Show that the relation \Hi,Y2\ = Y2 
follows from the relation [Hi,X2\ — — Xi- 

2. Recall the definition of the dual 7r* of a representation ir from Exercise [l4| 
of Chapter 5. Consider this for the case of representations of si (3; C). 

a) Show that the weights of n* are the negatives of the weights of it. 
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b) Show that if w is the irreducible representation of si (3; C) with highest 
weight (mi, m?) then ir* is the irreducible representation with highest weight 
(m 2 ,mi). 

Hint: If you identify V and V* by choosing a basis for V, then A tr is 
just the usual matrix transpose. 

Let f) denote the subspace of si (3;C) spanned by Hi and H 2 - Let G denote 
the group of all matrices A G SU(3) such that AdA preserves f). Now let Go 
denote the group of all matrices A E SU(3) such that Ad^4 is the identity 
on f), i.e., such that AdA(Ui) = Hi and AdA{H 2 ) = H 2 . Show that G is a 
normal subgroup of G. Compute G and Go- Show that G/Gq is isomorphic 
to the Weyl group W. 

a) Verify Theorems 3.31 and 6.32| explicitly for the case g = si (n;C). 

b) Consider the task of trying to prove Theorem |6 .33 for the case of 
si (n; C). Now that you have done (a), what part of the proof goes through 
the same way as for si (3; C)? At what points in the proof of the correspond- 
ing theorem for si (3; C) did we use special properties of si (3; C)? 

Hint: Most of it is the same, but there is one critical point which we do 
something which does not generalize to si (n;C). 
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Cumulative exercises 

1. Let G be a connected matrix Lie group, and let Ad : G — ► GL(g) be the 
adjoint representation of G. Show that 

ker(Ad) = Z(G) 

where Z(G) denotes the center of G. If G = 0(2), compute ker(Ad) and 
Z{G) and show that they are not equal. 

Hint: You should use the fact that if G is connected, then every A e G 
can be written in the form A = e Xl e X2 ■ ■ ■ e x ™ , with Xi E g. 

2. Let G be a finite, commutative group. Show that the number of equivalence 
classes of irreducible complex representations of G is equal to the number 
of elements in G. 

Hint: Use the fact that every finite, commutative group is a product of 
cyclic groups. 

3. a) Show that if R € 0(2), and deti? = —1, then R has two real, orthogonal 
eigenvectors with eigenvalues 1 and — 1. 

b) Let R be in 0(n). Show that there exists a subspace W of K™ which is 
invariant under both R and J? -1 , and such that dimTU = 1 or 2. Show that 
W 1 - (the orthogonal complement of W) is also invariant under R and R -1 . 
Show that the restrictions of R and to W and to W 1 - are orthogonal. 
(That is, show that these restrictions preserve inner products.) 

c) Let R be in 0(n). Show that R™ can be written as the orthogonal 
direct sum of subspaces W, such that 

(a) 1) Each Wi is invariant under R and J? -1 , 

(b) 2) Each Wi has dimension 1 or 2, and 

(c) 3) If dim Wi = 2, then the restriction of R to Wi has determinant one. 

d) Show that the exponential mapping for S0(n) is onto. Make sure 
you use the fact that the elements of S0(n) have determinant one. 

Note: This provides an alternative proof that the group S0(n) is con- 
nected. 

4. Determine, up to equivalence, all of the finite-dimensional, irreducible (complex- 
linear) representations of the Lie algebra si (2; C)©sl(2; C). Can your answer 
be expressed in terms of a sort of "highest weight" ? 

Hint: Imitate the proof of the classification of the irreducible represen- 
tations of sl(2; C). 

5. Consider the irreducible representation (ir, V) of si (3; C) with highest weight 
(0,2). Following the procedure in Chapter 6, Section 5, determine 

1) The dimension of V. 

2) All of the weights of w. 
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3) The multiplicity of each of the weights. (That is, the dimension of 
the corresponding weight spaces.) 
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